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Abstract 



The capabilities of a new approach towards the foundations of Statistical Mechan- 
ics are explored. The approach is genuine quantum in the sense that statistical 
behavior is a consequence of objective quantum uncertainties due to entanglement 
and uncertainty relations. No additional randomness is added by hand and no 
assumptions about a priori probabilities are made, instead measure concentra- 
tion results are used to justify the methods of Statistical Physics. The approach 
explains the applicability of the microcanonical and canonical ensemble and the 
tendency to equilibrate in a natural way. 

This work contains a pedagogical review of the existing literature and some new 
results. The most important of which are: i) A measure theoretic justification 
for the microcanonical ensemble, ii) Bounds on the subsystem equilibration time, 
iii) A proof that a generic weak interaction causes decoherence in the energy 
eigenbasis. iv) A proof of a quantum H- Theorem, v) New estimates of the 
average effective dimension for initial product states and states from the mean 
energy ensemble, vi) A proof that time and ensemble averages of observables are 
typically close to each other, vii) A bound on the fluctuations of the purity of a 
system coupled to a bath. 



This work is dedicated to Kathrin and Meggy, 
the two most important persons in my hfe. 



A philosopher once said "It is necessary for the very existence of 
science that the same conditions always produce the same results." 
Well, they do not. 



Richard Feynman, The Character of Physical Law 
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Hilbert spaces % Hs ^Hb 'Hr,... 

Hamiltonians J^, J^S: b-, ^^sb,--- 

eigenvectors I ^Jjfc), 1^;;), \Em),... 
eigenvalues Ek, Ei, Em,... 

observables and projectors 



quantum states 

pure states ip, (p & 'Pi('H) 
mixed states p, a E AiiTi) 
reduced states/marginals = TtbIp] G M{lis), P^ = T^^sip] e M{Hb) 
time averaged/dephased states oj — {pt)^ — $[po] '.— 2_j \^k) (EklPolEk) {E^] 




observables A, B, . . . 
projectors 11 
rank n projectors Pn(^) 
all projectors V{H) 




k 



trace norm 



||p||i = IV|p|=IV[v^] 



(0.0.1) 
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Hilbert space norm 
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= y/T^WA (0.0.6) 
operator norm of a hermitian operator A 
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Ve7'i(«) 

Von Neumann entropy 
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1. Introduction 



Despite being very well confirmed by experiments Thermodynamics and classical 
Statistical Physics still lack a commonly accepted and conceptually clear founda- 
tion. 

The reason for this unsatisfactory situation is that physicists have not yet suc- 
ceeded in finding concise and convincing justifications for the fundamental ax- 
ioms of Statistical Physics. An overview of the attempts to axiomatize Statistical 
Physics and Thermodynamics and to justify the axioms from classical Newtonian 
Mechanics and the conceptual problems with these approaches can be found for 
example in [1] and |2] and the references therein. 

Quantum Mechanics claims to be a fundamental theory. As such it should be 
capable of providing us with a microscopic explanation for all phenomena we ob- 
serve in macroscopic systems, including irreversible processes like thermalization. 
But, its unitary time evolution seems to be incompatible with irreversibility [3] 
leading to an apparent contradiction between Quantum Mechanics and Thermo- 
dynamics. This apparent contradiction is part of the long standing problem of 
the emergence of classically from Quantum Mechanics. 

To overcome this problem many authors have suggested to modify Quantum 
Theory, either by adding nonlinear terms to the von Neumann equation or by 
postulating a periodical spontaneous collapse of the wave function [4J. Others 
have considered effective, Markovian, time evolutions for open quantum systems 
[5] and it has been shown that system bath models that evolve under a special 
form of Hamiltonian tend to evolve into states that are classical superpositions 
of so called pointer states — a phenomenon called environmentally induced super 
selection, a term due to Zurek |6]. Depending on the author subsets of these 
approaches are subsumed under the term decoherence theory jZl El El E] . 

In face of the enormous success of standard Quantum Mechanics in explain- 
ing microscopic phenomena and the additional difficulties that arise when the 
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von Neumann equation is modified and the existence of macroscopic quantum 
systems on tlie one liand, and tlie broad applicability of Statistical Mechanics and 
Thermodynamics on the other, we feel that neither a modification of Quantum 
Theory, nor considerations restricted to special situations can provide a satis- 
factory explanation of the statistical and thermodynamic behavior of our macro- 
scopic world. Consequently we will seek to derive general statements independent 
of particular models and we will not use the Markov assumption. Furthermore, we 
believe that neither the assumption of ergodicity nor classical or quantum chaos 
are good starting points for constructing a convincing and consistent foundation 
for Statistical Mechanics and Thermodynamics (see for example footnote 1 and 
2 in [10]). 

The struggle for a quantum mechanical explanation of behavior usually de- 
scribed by Statistical Physics dates back to the founding fathers of Quantum 
Theory, most notably von Neumann [H] and Schrodinger jT2]. Recently work on 
this subject was resumed and there has been remarkable success: 

• In [T3| [T0| [TU [151 CHI [13 [H] a justification for the applicability of the canon- 
ical ensemble is given that does not rely on subjective, added randomness 
or ensemble averages. While [TU| [TU [T7] make particular assumptions on 
the Hamiltonian and introduce the concept of temperature, and thereby are 
able to derive explicitly the Boltzmann distribution, the aim of [13| [T5| [T6] 
is more to show that the reduced states of random states of large quantum 
systems typically look like the reduced state of the microcanonical state, 
|18j in addition uses time dependent perturbation theory. All these works 
are based on typicality arguments and the phenomenon of measure concen- 
tration [19j5 

• In [22| [23l [211 [251 126] it is shown how seemingly irreversible, thermodynamic 
behavior of macroscopic systems can be explained in the framework of stan- 
dard Quantum Mechanics and that the approach proposed in [T3| [271 [T5] is 
capable of explaining the phenomenon of equilibration in a natural way. 

^It is very interesting to compare thees articles with the works of Jaynes |20| |2I] Although 
there are huge differences concerning the interpretation, the before mentioned works are 
methodologically very close to certain aspects of the approach of Jaynes, especially with 
respect to the way they make use of measure concentration arguments. It is thus surprising 
and unfortunate that Jaynes' works have been completely ignored in the recent literature. 
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• There are some works that investigate equihbration and thermahzation in 
particular models [28| [29l l30[ |311 132]. Due to the additional structure in the 
less general situations considered in these works a more detailed analysis 
is possible and the authors can make assertions about the time scales on 
which equilibration happens. 

• In |33] it is shown how the concepts of work and heat can be defined on 
purely microscopical grounds without using classical external driving and in 
|34| the limits of purely quantum microscopic thermal machines are investi- 
gated. See also the references in [531 IM] for works discussing and applying 
definitions of work and heat based on time dependent Hamiltonians and 
external driving. 

• In 121] it is shown that a slow and continuous evolution of expectation 
values is typical for large quantum systems. 

• In [TT| it is shown that large quantum systems typically are in some 
sense ergodic with respect to coarse grained, macroscopic observables. Ref. 
|TT| was criticized in |37[ 138]. but recently it was argued that this criticism 
was unjustified [36j. 

• There have been attempts to derive the Second Law of Thermodynamics 
[391 Ho] or a statistical H- Theorem [H] for the von Neumann entropy from 
Quantum Mechanics and in |1T] (see also the older references 4 and 5 in [21]) 
a different entropy measure, "microscopic diagonal entropy", was proposed 
to overcome the contradiction between microscopic time reversal invariance 
and the Second Law. 

• In addition to the mainly analytical works cited above there exists a quickly 
increasing amount of numerical works concerned with equilibration and ther- 
modynamic behavior of open Quantum systems confirming the analytical 
findings [271 SSI Hal mill]. 

Unfortunately the often mathematically rigorous and far reaching results of 
these works are almost complete ignored by textbooks on Statistical Mechanics 
and ThermodjTiamics, this is true even for the results obtained by von Neumann 
in 1930 [11] (an exception is [8j). This situation is unfortunate since some of the 
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results mentioned above address long standing conceptual issues at the very heart 
of Statistical Mechanics and Thermodynamics. 
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Figure 1.1.. Many of the phenomena correctly described by thermodynamics, which 
is a mainly phenomenological and very applied theory, can be under- 
stood within the framework of Statistical Physics. Both theories have a 
very high degree of corroboration and have proved to be extremely use- 
ful. In the conventional approach (a) the methods of Statistical Physics 
are "derived" from Newtonian Mechanics and an additional layer of pos- 
tulates and assumptions that introduce statistical concepts and ensure 
equilibration. These additional assumptions are quite questionable and 
have provoked quite a lot of debate. The irreversibility introduced by 
postulating the Second Law of Thermodynamics contradicts the time 
reversal invariance of Newtonian Mechanics and it is still not known 
whether thermodynamic systems typically are (quasi) ergodic. The rad- 
ical, though natural approach pursued in this work (b) is to replace 
Newtonian Mechanics by Quantum Mechanics in the hope of getting rid 
of all extra assumptions. 
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Especially |THl HHl 1221 HSl [ISl E] argue for a new interpretation of the foundations 
of Statistical Mechanics. Following Seth Lloyd |13| we called this approach pure 
state quantum Statistical Mechanics. In what follows we give a concise and self 
contained review of the results of these and other related works in a unified and 
consistent notation. In the first section we introduce the general setup and fix the 
notation. We then review the recent progress in the field and present additional 
new results concerning the justification of the applicability of the microcanonical 
and canonical ensemble, equilibration, ergodicity and initial state independence. 
Finally we show that these results imply a statistical quantum Second Law of 
Thermodynamics. 

2.1. Setup 

We consider arbitrary quantum systems that can be described using a Hilbert 
space Ti of finite dimension We assume that all observables, including energy, 
are bounded linear operators, i.e have a finite operator norm. 

We will often talk about systems that can be divided into two parts, which we 
will call the bath B and the subsystem S, such that l-L = 'Hs®'^b where T-is and 
"H B are the Hilbert spaces of the subsystem and the bath respectively. It shall be 
emphasized that we will not make any special a priori assumptions about the size 

^If the Hilbert space of a real system is infinite dimensional it should always be possible to find 
an effective description in a finite dimensional Hilbert space by introducing a high energy 
cut-off. If eigenstates with extremely high energy had a crucial influence on the behavior of 
realistic systems physicists would be in a desperate position. Without the ability to prepare 
and thus study these states in detail it were very difficult to make reliable predictions. The 
author therefore believes that whenever the behavior of some model is crucially changed by 
introducing such a cutoff this is due to the very fact that it is a model. Moreover, it was 
demonstrated in [29 that many of the phenomena we that can be rigorously proven in the 
finite dimensional case also occur in infinite dimensional systems. We thus believe that the 
restriction to finite dimensions as mainly a technicality. 
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and structure of the bath and system. All results will be completely general. The 
only reason why we call one part the bath and the other the subsystem is that in 
the end we will be interested in situations where the dimension c/^ of the Hilbert 
space Hb of the bath is much larger than the dimension ^5 of the Hilbert space 
Hs of the system. 

We denote by V{'H) the set of all projectors on % and by Vni'H) the set of all 
rank n projectors on %. We write and for normalized pure state vectors 
and use ip and ip to denote their associated pure density matrices in ViiTi). The 
set of all, possibly mixed, normalized density matrices on Ti, i.e. the set of all 
positive-semidefinite hermitian matrices with trace one, will be denoted by A4.{'H) 
and we will use the symbols p and a for, possibly mixed, states from M.{'H). 
Their reduced states, or marginals, on the subsystem and bath are indicated by 
superscript letters like in p"^ = Tr^ p and p^ = Tr^ p. 

The Hamiltonian of the joint system ^ = has d energy eigenstates \Ek) 
with corresponding energy eigenvalues Ek that we will assume to be given in units 
of h. The Hamiltonian governs the time evolution of the joint system. If the initial 
state of the system was po we will denote the state at time thy pt = Ut p U} with 
[/^ = e-'-^*. 

The Hamiltonians considered herein are completely general except for one ex- 
tremely weak constraint, namely that they have non-degenerate energy gaps or 
are non-resonant^ This assumption imposes a restriction on the equality of the 
gaps between energy eigenvalues, namely 

Ek — El = Em — En ^ ^ 

(2.1.1) 

=^ {k = I Am = n) V [k = m Al = n). 

Note that there are two slightly different versions of this assumption: In the first, 
stronger version the indices k, /, m, n run over all eigenstates of the Hamiltonian, 
i.e. /c, l,m,n & {1, d}. This version implies that the spectrum of the Hamilto- 
nian is non-degenerate. In the weaker version the indices run only over all distinct 
eigenvalues, so that degeneracies in the energy spectrum are allowed as long as 
the gaps between the degenerate subspaces are non-degenerate. 

It shall be emphasized that even the stronger version is an extremely weak 

^This assumption already appears in the work of von Neumann [TT] and later in |251 12^ 155] 
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restriction as every Hamiltonian can be made to be non-resonant by adding an 
arbitrary small random perturbation. Generic Hamiltonians have non-degenerate 
energy gaps. Every Hamiltonian becomes non-degenerate by adding an arbitrary 
small random perturbation; therefore the Hamiltonians of macroscopic systems 
can be expected to satisfy this constraint. 

The physical implication of this assumption is that the Hamiltonian is fully 
interactive in the sense that there exists no partition of the composite system 
into a subsystem and bath such that the Hamiltonian can be written as a sum 
= ,^s®^ + 1 (8) B where J^s and b act on the subsystem and bath 
alone. 

In the following we will use the stronger version of the non-degenerate energy 
gaps assumption for the sake of simplicity. However, results similar to the ones 
presented herein hold under the second, weaker version. Basically, what one has 
to do is replace projectors onto energy eigenstates \Ek){Ek\ by projectors onto 
degenerate subspaces and refine some of the quantities appearing in the theorems, 
in particular the effective dimension (see the discussion in 

The consequence of the non-degenerate energy gaps assumption that we exploit 
in the present work is that time averaging a state pt that evolves under such a 
Hamiltonian ^ 

(pt), := lim - ! ptdt. (2.1.2) 

T->oo r Jq 

gives the same result as dephasing the initial state with respect to the energy 
eigenbasis of ^ 

$[po] ■.= Y,\Ek){EMEk){Ek\. (2.1.3) 

k 

We will therefore use the letter u = {pt)t = $[po] to refer to time averaged and 
dephased states respectively. 

In what follows we will often talk about random pure states drawn from some 
subspace Hr. Unless explicitly stated otherwise by a random pure state we mean 
a state that was chosen according to the Haar measure on Hr, which is the unique 
unitary left and right invariant measure on Vi{1-Lr} [10] (see appendix[B]for more 
information) . 



8 



2. Quantum Statistical Meclnanics 



2.2. Ensemble averages and pure state quantum 
Statistical Mechianics 



In conventional Statistical Mechanics probabilities, expectation values, variances 
and higher moments of observables are computed via ensemble averages. De- 
pending on the situation under consideration one must employ the microcanoni- 
cal, canonical or the appropriate grand canonical ensemble |l7j. The validity of 
this approach is beyond all doubt and the results obtained using it have been 
confirmed by innumerous experiments. 

On the other hand, the role of probability |^ |2T] in Physics, the problem of 
ergodicity and especially the microscopic justification of the Second Law of Ther- 
modynamic are very subtle issues and many fundamental questions concerning 
them are still open despite many decades of research [Tj. 

The starting point of our discussion will be to show how the applicability of 
ensemble averages can be justified using Quantum Mechanics and measure con- 
centration techniques without any extra assumptions. 



The microcanonical ensemble is in some sense the most fundamental ensemble. 
In classical Statistical Physics it is applied to closed systems in equilibrium. The 
other ensembles, canonical and grand canonical can be derived from it |47) . 

In the quantum setting the microcanonical ensemble is used in situations where 
all one knows about a closed physical system is that the value of some observable 
A, which corresponds to a conserved quantity, i.e [J^, A] = 0, lies in some interval 
/|^ Let I a) be the eigenvectors of A and T-Lr the restricted subspace spanned 
by those eigenvectors that have eigenvalues in the interval. The microcanonical 
expectation value of any observable B with respect to is then defined to be 



•^Note that thermodynamically closed does not necessarily mean completely isolated [S]. In 
this section we will however talk only about completely isolated systems. 



2.2.1. The microcanonical ensemble 




(2.2.1) 



where Ur = J2\a)en 



R 



a){a\ is the projector onto the subspace Hr of eigenstates 
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of A with eigenvalues in I. Knowing only that measuring A would give a value 
in / we ascribe to the system the mixed state |^ 



dR 



1 

dR 



E 



a){a\. 



(2.2.2) 



Equation (2.2.1) and (2.2.2) are the quantum version of the equal a priory prob- 



ability postulate, which is the fundamental postulate of convectional Statistical 
Mechanics. All compatible states are assigned the same a priory probability. 

It is beyond all doubt that this approach to calculate expectation values has 
proven to be extremely useful and yields results in good agreement with experi- 
ments. However it remains puzzling why dynamically evolving and intrinsically 
quantum mechanical systems may be described by the static, highly mixed state 
(I2T2I). 



Typicality of general observables 

The recent results suggest that the equal a priory probability postulate is dispens- 



able [16.]. Instead of assuming that the state (2.2.2) yields a good description of 



the system it is possible to proof that for almost all pure states of large systems 



all subsystems behave as if the system were in the state (2.2.2). A statement the 
authors of [16] called General Canonical Principle. 

The idea to reproduce the results obtained using the microcanonical ensemble 
average, without added randomness form nothing but pure Quantum Mechanics, 
and thereby justifying its use, was already discussed in 1991 by J.M. Deutsch 
|45) . A mathematically more precise statement about the equivalence of ensemble 
averages and expectation values of random pure states can be found in the Ph.D. 
thesis of Seth Lloyd which appeared in the same year [13j: 

Theorem 2.2.1. |13] Let 'Hr'^'H be a subspace of dimension (Ir of the Hilbert 
space % of some physical system. Let II/j be the projector onto Hr and let (■)^ 
be the average over random pure states ip G Vi{'Hr). Then for every observable 



■^Note that there are other possible generalizations of the microcanonical ensemble to the 
quantum setting that are discussed in the literature (s. [5U1 ICTj l. 
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B with [B, Ur] = 



<(Tr[5^] - {BUr)^ = ^((5^)^, - {B')l) < (2.2.3) 



The interpretation of theorem 2.2.1 is straight forward: If the dimension dR of 
TIr is large, it tells us that the mean square deviation the expectation value of 
Tr[i?'?/'] computed over random pure states G Vi{'Hr) from the microcanonical 
expectation value (B)^^ is small, which implies that the two expectation values 
will be similar with high probability. 

The methods used in [IS] to proof the General Canonical Principle, namely 
Levy's lemma (see appendix [C]), can be used to proof a stronger, exponential 
bound on the probability to observe a deviation from the predictions of the mi- 
crocanonical ensemble when measuring an observable acting on the full Hilbert 
space: 

Theorem 2.2.2. Let Hr H be a suhspace of dimension dR of the Hilbert 
space "H of some physical system. The probability that the expectation value of an 
arbitrary observable B in a randomly chosen pure state G Vi{'Hr) differs from 
its microcanonical expectation value with respect to Hr is exponentially small in 
the sense that for every e > 



Pr{|Tr[5V] - (5)^J > e} < 2 e (2.2.4) 
where C is a constant with C = (36 vr'^)^-'^. 

Proof. The proof is almost completely analogous to a proof in appendix VI of [15j 
and relies on Levy's lemma (s. appendix [C]) . For an arbitrary fixed observable B 
we define the function 

fBW=TT[Bij]. (2.2.5) 

The expectation value {fsi'^P))^ of this function with respect to a randomly chosen 
pure states ip G Hr clearly is 

{fsW)^ = mSi^])^ = Tr[B iij)^] = Tt[B ^] = {B)^^ . (2.2.6) 



^The additional constraint [B, Hji] = is not discussed in the main text of but it is stated 
and used in the proof the theorem. 
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Its Lipschitz constant rj with respect to the Hilbert space norm is upper bounded 
by 2||5||oo, as [I5]: 



|/B(^i)-/ij(V'2)| = |Tr[5(^i-^2)]| 

< ||i?||oo 111^1) + 1^2)112 lll^l)- 1^2)112 
<2||5|U|||V^l)-|V^2)||2 



(2.2.7) 



Applying Levy's lemma (see appendix [C]) to fsiip) gives the desired result. □ 



Theorem |2.2.2| tells us that as (Ir becomes large the set of states ip for which 
Tr[i?'0] deviates from (B)^^ by at most a given amount becomes exponentially 
small. Typical states will give expectation values that agree very well with the 
predictions of the microcanonical ensemble. 

Of course, typicality of expectation values is not sufficient to justify the micro- 
canonical ensemble from measure theoretic considerations. Variances and higher 
moments also need to be considered. 



In [13] it is claimed that theorem 2.2.1 implies that not only the expectation 



values, but in addition all higher moments are likely to be close to the micro- 
canonical ones for typical states. But what is actually proved is that the variance 
in state computed with respect to the microcanonical expectation value 



Tr[(5-(B)^JV]. 



(2.2.8) 



is close to the microcanonical variance 



(2.2.9) 



with high probability given that dji is large. The additional deviation caused by 



the fact that (2.2.8) differs from the variance in state tjj 



=Tr[(i?-Tr[5^])2^] 



(2.2.10) 



is not taken into account. 

But, as one may already anticipate, the additional error typically is very small. 



so that it is not surprising that theorem 2.2.2 can be used to proof that not 
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only the expectation values, but in addition the variances of almost all states are 
compatible with the variance of the microcanonical ensemble. We expect that 
similar statements hold for all higher moments. 

In particular we can proof that: 



Theorem 2.2.3. Let C'H be a subspace of dimension of the Hilbert space 
of some physical system. The probability that the variances of some observable 
B in a random pure state ip eVi {%r} 

al^Ti[{B-TY[B^]f^] (2.2.11) 

differs from the variance that follows from the microcanonical ensemble 

is exponentially small, in the sense that for every e > 

Pr{kJ-^mcl > \\B\Le] 

< min 2 e-(^'^«(^-'5))+2 e'^^'^^'^') (2.2.13) 

0<S<e ^ ' 

< 4g-CdH(i+2.-^i+4i) (2.2.14) 



where C is a constant with C — (36 tt ) 



3\-l 



Proof. Let /i^ and /Xmc be the n-th moment of the probability distribution of 
the observable B with respect to the state ip and the microcanonical ensemble 
respectively, so that in particular /x^^ = cr^ and //mc = c^c- simplify the 
notation we define 

^^;^) = \^f-T:r[{B~{B)^Xn (2.2.15) 
A(") = I Tr[(S - (i^)^J" V'] - A (2.2.16) 
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For all e > we have: 



Pr{|/^;"^-/iL1l>e} 

< mm Pr |aS"^ > e - 5 V aJ"^ > (2.2.17) 

< mm Pr | aS"^ > e - + Pr |a^"^ > 5^ (2.2.18) 



The second term in the last line can be bounded. Applying theorem |2.2.2| to 

5„ = (i? - {B)^J- gives 



Pr I A^"^ > 5| < 2 e Ml . (2.2.19) 
This is an exponential version of the bound found in [13]. 



Bounding the first term is in general more complicated except for the variances 
where we can use the following argument: Assume that the deviation between 
Tr[B^|J] and {B}^^ is 



then 



e' = TT[Bij]-{B)^ 



(2.2.20) 



= Tr[(i? - TT[B^]y^] = Tt[{B - ((5),,, + e'))^^] 
= Tr[(i? - {B)^f^] + e" -2e' Tr[(i? - {B)^J ^] 
= Tr[(5-(5U^^]-e'^ 



(2.2.21) 
(2.2.22) 



so that 



Tr[i?^]-(i?LJ<6'^Ap)<e'2 



and therefore we have by theorem 2.2.2 for all < 5 < e 



PrjAf^ > e-(5| 

< Pr 1 1 Tr[^ B] - {B)^^ \ > 



_Cdjf^(e-6) 

< 2 e iisii§o . 



(2.2.23) 



(2.2.24) 
(2.2.25) 
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Combining the two estimates we arrive at: 

Pr\\al - alA > e| < min 2 e +2 e iK^-WmJ^iiio (2.2.26) 

Now, every observable can be renormalized such that (B)^^ = and rescaled such 
that its operator norm is one. Doing this one changes the variance by a factor of 
ll-^lloo th.at we get 

Pr{|aJ-aLl > e} < min 2 e'^^-^^^^-^)) +2 e-^^"^-^') . (2.2.27) 

Substituting S = l/2(-\/l + 4e — 1) gives the second bound. □ 

Note that all important steps in the above discussion are valid also for higher 
moments except for the bound on PrjA^"'' > 6}, which is especially simple for the 
special case n = 2. We expect however that slightly more complicated arguments 
can be made for all higher moments. 

Measuring the same typical pure state of a large enough quantum system we 
therefore can expect to not only get expectation values that are close to the 
microcanonical ones but in addition the observed variances will be almost identical 
to the ones predicted by conventional Statistical Mechanics. Note that these 
variances are caused by objective quantum uncertainties and not by ensemble 
averages due to a subjective lack of knowledge of the micro state. 

Concluding we may say that, given an ensemble of large quantum mechanical 
systems we are, by measure only a reasonably small number of observables, with 
very high probability, unable to decide whether all systems of the ensemble are 
in the same random pure state choose from some subspace, or representatives 
of the corresponding microcanonical ensemble. We call this property of large 
quantum systems microcanonical typicality. However, there are combinations of 
initial states ipQ and observables B that give a measurement statistic that deviates 
radically from the predictions of the microcanonical ensemble. This happens for 

^The interpretation of the word objective depends on the preferred interpretation of Quan- 
tum Mechanics. A discussion of this point (that conies to the conclusion that Quantum 
Mechanical probabilities are not objective in a certain sense) can for example be found in 
|21) . However, they are certainly in some sense more objective than probabilities that result 
form the voluntary dismissal of information due to coarse graining. We shall not elaborate 
on this point here as it would lead us to far away from the subject of this work. 
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example when i/jq is an eigenstate of B. These measurements are the ones that best 
characterize the system under consideration and an experimentahst will always 
seek for such a characterization. Thus the physical significance of the above results 
is questionable. 

In the following sections we will elaborate more on this point and present argu- 



ments similar to theorem 2.2.2 for coarse grained observables and for situations 
where only a subsystem of a larger quantum system is experimentally accessible 
and we will see that in these situations the criticism expressed above does not 
apply. 



Typicality of coarse grained observabies 

We have seen that when all observables are experimentally accessible there always 
exist measurements, in particular measurements in the eigenbasis, which give a 
measurement statistic for a random pure state that deviates radically from the 
one predicted by the microcanonical ensemble. 

However, on macroscopic systems most observables are not accessible. This is 
not only a consequence of experimental limitations but manly due to the vast 
number of dimensions of the Hilbert spaces of macroscopic systems [TTl [2lj . As 
an example consider the spin degrees of freedom of a macroscopic magnet. The 
typical Hilbert space of such a system has a dimension of the order of 2^°^^. Trying 
to measure an observable that can distinguish that many states, or even worse, 
doing state tomography on such a system, certainly is a completely futile task. 

Obviously we need to find a way to take our limited capabilities into account 
when seeking a realistic description of macroscopic systems. The way we will do 
that here is the simplest and most straight forward one can possibly think of and 
similar considerations date back to the work of von Neumann 

Let M = {Mi} be the set of experimentally accessible macro observables 
Mi, where, without loss of generality we can assume that the Mj are positive- 
semidefinite Mj > and have trace one TrMj = 1. We think of the Mi as 
macroscopic observables, so that, due to the limited resolution of our measure- 
ment apparatuses, the Mj will be highly degenerate. Furthermore we want the 
Mi to be classical in the sense that [Mi,Mj] = 0. Such a set M of commuting 
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observables induces a pseudo norm and an associated pseudo trace distance 

Vm{p, (t) = max TtIMJo - a)] (2.2.28) 

which measures how well two states p and cr can be distinguished from one another 
by the restricted set of observablesj^ The set of accessible measurements partitions 
the total Hilbert space Ti of the system into a complete set of m orthogonal 
subspaces {"H^} with "H^ = "H of macro scopically distinguishable states, or 

macro states, such that states from one subspace can not be distinguished by 
any of the Mj and that two states are distinguishable by at least one of the Mj 
whenever they are in different subspaces: 

VH, \fp,a eViCHr) pAf(p,a) = (2.2.29) 

^ns^pe -Hr, a e ViiUs) VMip, fx) > (2.2.30) 

Every macroscopic observable A that we can measure by using all our measure- 
ment capabilities is of the form 

m 

A = '^arllr (2.2.31) 

r=l 

where the 11^ are the projectors onto the corresponding subspaces Hr and the ar 
real parameters. 

In realistic situations we can expect that m <^ d and the following theorem 
tells us that we are unlikely to have any chance of distinguishing a random pure 
state from the microcanonical state under these conditions: 



Theorem 2.2.4. Let 1-Lr C 'H be a restricted subspace of dimension d^ of the 
Hilbert space T-i of some physical system. Assume that the physically feasible, 
macroscopic measurements allow one to distinguish a total number of m macro 
states. Then the probability that a random pure state ip G Vi{'Hr) gives an expec- 
tation value for any of the accessible macroscopic observables A that differs from 

^Note that Pm(', •) reduces to the normal trace distance if M ~ M{H). See appendix [a| for 
more information on distance measures for quantum states. 
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that of the microcanonical one with respect to 1-Lr is exponentially small, namely 
Pr |max|Tr[A?/'] - (A)^J > e| < 2m e ii-^ilL , (2.2.32) 
where C is a constant with C = (36 vr''^)^^. 

Proof. The proof is inspired by the considerations in appendix VI of [15j. As 
explained above M defines a set of mutually orthogonal projectors 11,. onto sub- 
spaces of indistinguishable states and consequently every accessible observable is 
of the form 

m 

A = ^arllr (2.2.33) 

r=l 

SO that ll^lloo = maXr la^l. Obviously for all such observables it holds that 

m 

I Tt[A^] - (A)^^^ I < ^ |a,(Tr[n, V^] - (n,)^J| (2.2.34) 

■r=l 

< m max|a^||Tr[n^V'] - {^r)^c\ (2.2.35) 



Inserting B = (Xr into theorem 2.2.2 we find that for random pure states 



Pr{|a,||Tr[n,^/;]-(n,)^J>e}<2e (2.2.36) 
where C = (36 tt^)"^. Using the union bound we see that this implies that 



Pr{3r: | Tr[n^ t/-] - (n^)^J > e} < 2 m e (2.2.37) 
so that for all accessible observables A 



Pr{|Tr[A^] - (A)^J > me} < 2m e . (2.2.38) 



□ 



The important quantity in the above theorem is the quotient dR/m in the ex 



ponent of (2.2.32) which quantifies how good our abilities to prepare and measure 



a state are. Assuming that the dimensions of each of the subspaces of indistin- 
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guishable states are approximately identical one can expect that d/m and d 

grows exponentially with the number of constituents of the system. In contrast m 
is basically given by the spread of the spectra of the physically accessible observ- 
ables divided by the resolution of the measurement apparatuses. The spread of 
the spectra can be expected to grow at most polynomial with the system size and 
the resolution of the measurement apparatuses will be roughly independent of the 
system size. One can therefore expect that for large enough systems one enters 
the regime where dji ^ m? and where the above theorem becomes meaningful. 



In contrast to theorem |2.2.2[ which we have criticized for being of limited 
significance, as there always exist observables capable of distinguishing between 
a random state and the microcanonical state, theorem |2.2.4 is a statement about 
all accessible observables. 



2.2.2. The canonical ensemble 

The usual situation in which the canonical ensemble is applied are subsystems of 
weakly interacting composite systems whose total energy is known to lie in some 
narrow interval. A slightly more general situation is that of a composite system 
subject to the constraint that the value of some observable A corresponding to 
an extensive and conserved quantity is known to lie within some interval. This 
understanding of the canonical ensemble includes what is sometimes called the 
grand canonical ensemble. For the sake of simplicity we restrict ourselves to the 
canonical case where A = J^. The generalization to the grand canonical case is 
almost trivial. 

Using the canonical ensemble to calculate expectation values is equivalent to 
assuming that the state of the system of interest is given by the so called canonical 
state 

Pc=\e-^-''^ = \e-^''^\El){Ell (2.2.39) 

where (3 is the inverse temperature, \E^) the eigenstates of the system Hamilto- 
nian ^ s ^-nd 

Z = Tre-^^s (2.2.40) 
the partition sum, which ensures normalization. 



Taking (2.2.39) as the system state is usually justified by regarding it as a sub- 
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system of a larger, closed composite system to which the microcanonical ensemble 
can be applied [TTl |52l |53]|^ The following is a sketch of how this justification 
works. 

The argument presented herein follows closely the discussion in [17j. Note 
that the argument is solely based on combinatorics and the identification of the 
thermodynamic entropy with the entropy defined via the number of compatible 
micro states. There is nothing specifically quantum to it. Very similar arguments 
can be found in nearly every textbook on Statistical Mechanics. 

The Hamiltonian of the composite system 

= ,Jifs®t + l^,JifB + 'y^sB (2.2.41) 

consists of a system Hamiltonian a bath Hamiltonian J^b and an interaction 
term J^sb- The interaction term is assumed to be small in the sense that the 
total energy of the system is approximately the sum of the system energy and 
the bath energy, i.e. that energy is extensive, and that the energy eigenstates are 
close to product states. 

The energy of the composite system is assumed to be known to lie in some 
interval [E, E+AE] that is assumed to be small on a macroscopic energy scale, but 
still large enough such that the subspace J^r spanned by the energy eigenstate 
with eigenvalues in the interval is large. 

Assuming that the composite system is in the microcanonical state and using 
that the energy eigenstates of are approximately product states we find for 
the reduced state of the system 

pt = TTBPrnc (2.2.42) 
^^f2dk{E^)\El){E',\, (2.2.43) 

k=l 

where the \E^) are the eigenstates of with energy E^ and the dk{E^) are 
the number of eigenstates of J^b with eigenvalues in the interval [E — E'^, E — 
Ei + AE]. 

^Alternatively one plead the Bayesian probability and the principle of maximum entropy prin- 
ciple mi [20]. 
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The last step is to introduce the concept of temperature. The inverse temper- 
ature of the bath is defined via /3 = dS{E^)/dE^ where S{E^) is the entropy 
of the bath when it is held at energy E^. Assuming that the energy levels of 
the bath become exponentially dense with increasing energy, which seems to be 
a reasonable assumption for most thermodynamic systems, one can expect that 
S{E^) ^ log((ifc(i?^))|^ Such that, if the bath is much larger than the system we 
have: 

dkiE"") ^ e^(^-^f) ^ e^(^)-/5^f oc e"^^' (2.2.44) 



So that finally one reaches the conclusion that = Ttb Pmc ~ Pc under the 
given conditions!^ 

Now the question is: Is it possible to come to the same conclusion without using 
the ad hoc assumption of the microcanonical state for the composite system? 
In [13] consequences of theorem 2.2.1 on the equivalence of expectation values 
obtained using the canonical ensemble and expectation values of typical quantum 
states have been already been discussed. Using similar arguments it is shown in 
[371 [TOl [T7] that the reduced state of a typical random state from the subspace 
compatible with the imposed energy constraint will, with high probability, be 
close to Pc- Herein we focus on the more rigorous exponential bounds provided 
by theorem 2.2.2 and the results obtained in [15] . 



Of course theorem 2.2.2 is also applicable to observables that act only locally on 
the subsystem and our considerations concerning variances and higher moments 
also remain valid. Consequently theorem 2.2.2 and 2.2.3| already tell us that 
the measurement statistics of local observables does not differ much whether we 
assume that the composite system is in the microcanonical state corresponding 
to or in one particular random pure state from Hr. 

For reduced states of random pure states an even more powerful statement can 
be proved. This is the main result of [15] : 



'This is probably the most critical step in the argument. The assumption of exponentially 
dense energy gaps conflicts with the assumption that sb does not significantly infiuence 
the eigenstates of the uncoupled Hamiltonian s ®1 + 1 (8) ,j^b, as this can be guarantied 
only when the coupling is smaller than the energy gaps of the uncoupled Hamiltonian. 

'Using a similar argument, but under additional assumptions on the interaction Hamiltonian, 
namely that it only couples adjacent energy eigenstates, the canonical ensemble is also 
derived in |10| . 
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Theorem 2.2.5. (Theorem 1 in [15j)[^ Let I-Lr C y, be a subspace of di- 
mension dpi of the Hilbert space l-i = Hs^'Hb of some physical system. The 
probability that the reduced state = TiBi' of a randomly chosen pure state 
ip G Vi{'Hr) is more than e > away from the reduced microcanonical state 
p^j, = TiB Pmc is given by 



Pr { P(p^pL) > 2e + 2 J4 !^ < 2 e-^'^«^^ (2.2.45) 



with C = (18 Tr^)-^ and 



df 



< = d^'ipt) > ^. (2.2.46) 



Whenever d^ ^ ^5, which is exactly the situation we are interested in, this 



theorem gives a full replacement for the assumption made in (2.2.42 ). If one trusts 
the argument presented above that p^^ ^ p^, this theorem, together with the usual 
assumption of weak interaction, proves that almost every pure state drawn from 
a sufficiently large subspace is locally equivalent to the canonical state. That is, 
there exists no measurement at all by which they can be distinguished. This is a 
measure theoretic justification for the applicability of the canonical ensemble that 
does not rely on the microcanonical ensemble or the equal a priory probability 
postulate. The authors of \15\ call it General Canonical Principle. 



2.3. Average effective dimension of random pure 
states 

In this section we will discuss the effective dimension 

d-'i.) = (2.3.1) 

where u = $[ijjo] = {i^t)t^ of random pure initial states ipQ drawn according to 
different distributions. This quantity will be important in the following discussion. 
Roughly spoken we will find that a high effective dimension causes thermodynamic 
behavior, while a small effective dimension will make quantum effects observable. 



In many situations theorem 2.2.5 can be further improved. See [TS] for details. 
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Before we go on it is useful to develop an intuitively understanding for the 
effective dimension. Obviously we have d^^lip) = 1 if is pure and the completely 
mixed state has an effective dimension of (F^{l/d) = d. Expanding an arbitrary 
pure initial state ipQ in the energy eigenbasis as follows 

^o = 5^Cfcc;e-'(^'=-^')*|^fc)(^/| (2.3.2) 

kl 

we find that, under the assumption of non-degenerate energy gaps, its effective 
dimension is 

- mm ^ OT- 

Therefrom we see that the effective dimension can be interpreted as a measure for 
the number of energy eigenstates that contribute significantly to the given initial 
state ipQ. This intuition can already serve as a justification for the assumption 
that for macroscopic objects d^^{u) will typically be very large. 

In the remainder of this section we will establish a number of rigorous measure 
theoretic statements supporting this intuition. The considerations will necessarily 
be quite technical. In particular, we will consider states drawn according to the 
Haar measure from subspaces of the total Hilbert space, product states, where 
both tensor components are drawn from subspaces according to the Haar measure, 
and states from the mean energy ensemble. When first reading this work it is 
maybe better to settle with the intuitive argument given above, skip the rest of 



this section and continue reading in section 2.4 



2.3. 1 . States drawn from subspaces 

One of the centrals result derived in [25] is that almost all pure states drawn 
according to the unitary invariant Haar measure from a high dimensional subspace 
have a high effective dimension: 

Theorem 2.3.1. (Theorem 2 in |25| ) i) The average effective dimension with 
respect to a Hamiltonian with non-degenerate energy gaps i^d'^^ {uj)'^ where the 
average is computed over uniformly random pure initial states ipo G VilHn) drawn 
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from some subspace "H^ El-i of dimension d^, is such that 

a) For a random pure initial state i/jq G Viil-L-ji), the probability that d^^iu) is 
smaller than dji/A is exponentially small, namely 

Pr|(i^«^(a;)<^l<2e-^v/^ (2.3.5) 



with a constant C 



ln(2)^ 
72 7r3 ■ 



The above theorem states that whenever one draws a state according to the 
Haar measure form a high dimensional subspace one will almost certainly get a 



state with a high effective dimension. Note that theorem 2.3.1 is a very strong 
statement. It is actually much stronger than what we will need in the following, 
namely that d'^^{uj) is much larger than some low, fixed power of the dimension 
of the Hilbert space of the subsystem ds- 

2.3.2. Product states 



A particularly interesting class of initial states are product states. Theorem 2.3.1 
shows that almost all states chosen from sufficiently large subspaces have a high 
effective dimension. The set of product states however is not a subspace. 



The applicability of theorem 2.3. 1| to product states is therefore limited to 



the case where either the system or the bath states are fixed and the other is 
chosen from a subspace T-Lsfi or T-Lsfi of the Hilbert space of the bath or system 
respectively, such that = 1})^ ® 'Hb,r or Kr = 'Hs,r '^^q ■ 



Here we show that a slightly modified version of the first part of theorem 2.3.1 
holds for product states where both the system and the bath part are chosen from 
subspaces 'Hs,r and 'Hb,r respectively: 

Theorem 2.3.2. The average effective dimension with respect to a Hamiltonian 
with non-degenerate energy gaps (^d^^ (to)') i^^b where the average is computed 
over product states ipQ ® tp^ consisting of uniformly random pure initial states 
i/Jq^^ G Vi^Hs/Bfi) chosen from subspaces 'Hs/b,r ^ 'Hs/b of dimension ds/B,R 
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respectively is such that 



> 



{ds,R + 1) {dB,R + 1) 



(2.3.6) 



Proof. The proof uses some of the ideas from the proof of theorem 2 in [25]. The 
first step is to see that the average effective dimension is bounded by the inverse 
of the average purity of the time averaged state as follows. 



> 



1 



To bound the average purity we first use the simple identity 

Tr[AB] =Trp®5)§], 



(2.3.7) 



(2.3.^ 



where S is the swap operator of the two tensor components. Equation (2.3.8) can 



a basis. 




^{kl\{A®B)^kl) 
ki 


(2.3.9) 


Y^{kl\{A®B)\lk) 

kl 


(2.3.10) 


kl 


(2.3.11) 


^{k\AB\k) = Tt[AB] 


(2.3.12) 



Second, we need the following lemma, which follows from the representation the- 
ory of the unitary group: 

Lemma 2.3.1. [25j Let (•)^ be the average over random pure states ip G Vi{'Hr) 
drawn from some subspace 'Hr'^'H of dimension d^. Then 



TIrr (1 + §) 



(2.3.13) 



dR{dR+l)' 

where TIrr = Hr ® H/j and Ur is the projector onto the subspace Hr. 

Third, we need the assumption of non-degenerate energy gaps to identify the 
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time average with the dephasing map introduced in (2.1.3). In addition we need 



another hnear swap operator § that is defined via its action on product states, 

§\sbs'b') = \ss'bb') (2.3.14) 
where |s), |s') G Hs and \b), \b') G Hb- Note that § is unitary, §^ = 1 and 

||S||oo = 1. 

Writing \k) instead of \Ek) for the eigenstates to simphfy the notation, the 
average purity can be written as follows: 

<Tr[^']>,f^,„- = <Tr[$(^o" ® ^or]).,s^^B = (2.3.15) 
= (Tr[($ ® $)(^o^ ® ® i;'^ ® ^o^) ^)^s^^n (2.3.16) 

= (Tr[($®$)(§(V^o^®V^o^®V^o''®V^o'')S)S]) , ^ (2.3.17) 

= (Tr[{$0$){Si^'^0^'^0^l^®^^)S)S]) (2.3.18) 

= Tr[($ ® $)(§ ((^0^ ® ^0^)^, ® (^o"" ® ^o''>^b) S) §] (2.3.19) 
= Tr[($ ® $)(§ (n55,R (I55 + S55) ® ns5,R (Ibb + ^bb)) S) §] (2.3.20) 
= Yl Tr[|fc/)(fc/| § {Uss,R (tss + ^ss) ® T^bb,r (Ibb + Si^s)) S \kl){kl\ §] 

(2.3.21) 

= D-^ ^Tr[|A;/)(/fc|] (fc/|§(n 55,/? (I5S + ^Ss) ® nBB,K (Ibb + Sbb)) S I/c/) 

(2.3.22) 

= J2^kk\S iUss,R (tss + §55) ® nsB.K (Ibb + §bb)) § |A;A;) (2.3.23) 

k 

Thereby 1ss/bb and Sss/bb are the identity and the swap operator on the product 
spaces Tis/B ®'Hs/Bi the T^ss/bb,r are the projectors onto the symmetric product 
of subspaces 'Hs/b,r®'Hs/b,r. respectively and D = ds^R {ds,R + l) dB,R {dB,R+l)- 
If the restricted subspaces for both the system and the bath are taken to be 
the full Hilbert spaces the fact that ||S||oo = 1 and ||S5s/_bb||oo = 1 is sufficient to 
immediately see that 
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To proof the full lemma all that remains is to show that the sum in (2.3.23) 
is bounded by 4/(^5^^? rf^.j?)- The calculation is quite lengthy but most parts are 
straight forward, therefore we discuss it only briefly. 



The energy eigenstates can be written as linear combinations of product states 

\k) = J2^ksb\sb), (2.3.25) 

sb 

where the \s) and \b) form an orthonormal basis of the system and bath Hilbert 
space, which is chosen such that the first ds/B.R vectors span the restricted sub- 



spaces 'Hs/B,R- Expanding the middle part of (2.3.23) gives four terms. The two 
symmetric terms, the one without any S55 or Sbb, and the one with both §55 
and EiBB are both equal to 

<is,R d,B,R 

^2.3.26) 

k ss' bb' 

while the two asymmetric terms are equal to 

Xll] ^^^'^ '^ksh 4s'b'- (2.3.27) 

k ss' bb' 



Both contributions are real and (2.3.27) is always smaller or equal than (2.3.26). 



This can be seen by using the fundamental inequality 

ab* + ba* <\a\^ + \b\'^ (2.3.28) 
with a = CksbCks'b' and b = Cks'bCksb', which gives 

Cks'b Cksb' Clsb ^*ks'b' + Cfcsfe Cks'b' Cfcg/f, Cl^y ^ 
^ II l|2 II ||2 I II ||2 II ||2 \ • • I 
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Finally the first symmetric term can be bounded as follows: 



ss,R ® ^bb,r) ^kk) 

k 

= Y,{kk\{Us,R ® Ub,r ® Us,R ® UB,R)\kk) 

k 

= J2{kms,R ® UB,R\k){k\ns,R ® nB,R)\k) 

k 

<J2ik\(^s,R^T^B,R\k) = ds,R dB,R 



(2.3.30) 
(2.3.31) 
(2.3.32) 



This completes the proof. 



□ 



First note that if either ds = 1 and thereby Hr = ^/'q ® Hb,Ri or = 1 and 
thereby I-Lr = l-Ls r ®V'(f ? recover the result of theorem 2.3.1 The new version 



of theorem 2.3.1 is sightly better than the original one in situations where both 
the system and the bath state are drawn from subspaces of comparatively large 
dimension. 



2.3.3. States from ttie mean energy ensemble 



In theorem 2.3.1 the bound on the probability to get a state with a low effective 
dimension drops of exponentially. This raises the hope that the result does not 
depend on the details of the measure from which the states are drawn and that 
similar statements hold true for other non-singular measures. The Haar mea- 
sure and the unitary invariant ensemble used in both theorem |2.3.1| and |2.3.2| is 



sometimes criticized for being unphysical. It is therefore worth considering other 
more physically motivated ensembles. In this section we will calculate the average 
effective dimension in the mean energy ensemble 



12 



Without loss of generality we assumed that the Hamiltonian ^ of the sys- 
tem under consideration is positive, has non-degenerate energy gaps and that its 
eigenvalues are ordered such that Ek < Ek+i^k. The mean energy ensemble to 
energy E is defined as the set of normalized pure states ip with energy expectation 



^^The results presented in this section partially originate from a discussion with Markus Miiller 
and Jens Eisert in October 2009. 
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value E 

Me = {^: Tt[iIj] = 1, Tr[^ tP] = E}. (2.3.33) 

In [50| H9l |5^ l55j this ensemble was suggested as a natural alternative to the con- 
ventional definition of the microcanonical ensemble we discussed in section 12.2.11 
Note that the mean energy ensemble is invariant under shifting and rescaling 
of all involved energies, i.e. the energy E and all eigenvalues Ek of the respective 
Hamiltonian, whereas the Harmonic mean 

Eh = (2.3.34) 

is a nonlinear function of the E^ and is therefore not invariant. By appropriately 
shifting all energies Efj can be adjusted to all values between the ground state 
energy Eq and the mean energy E0 = TT[Jif]/d while at the same time keeping 
all energies positive [51j. It is therefore always possible to shift the energies such 
that E ^ Eh when Eq < E < E0. 



Using this trick it is shown in [5T] that the manifold of states defined (2.3.33) 
shows a strong concentration of measure phenomenon and a method to approxi- 
mately sample states from the mean energy ensemble is derived: 

Theorem 2.3.3. (Algorithm 21 in [56j) Consider the mean energy ensemble to 
energy E of a d-dimensional quantum system whose Hamiltonian with eigen- 
vectors \Ek) and eigenvalues E^ is assumed to be positive. If E < E0 = TT[Jif]/d 
so that all energies can be shifted such that E ^ Eh = d/ Ylk ^/-^k o-nd if E satis- 
fies some mild additional constraints (see the original paper /37]/ for more details) 
a state vector 

= J2^k\Ek) (2.3.35) 

k 

from the mean energy ensemble at energy E can be sampled to good approximation 
by choosing the real and imaginary parts of the expansion coefficients indepen- 
dently from normal distributions with variances 



.»=^J^. (2.3.36) 

// the spectrum of fulfills some additional constraints the described procedure 
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becomes exact in the thermodynamic limit (i — > oo (for details see JSTj). 

The sampling method is similar to the sampling procedure for the Haar measure 
ensemble (see appendix [b]). But the variances of the normal distributions from 
which the real and imaginary parts of the expansion coefficients are drawn are 
now functions of the energy of the respective eigenstate of the Hamiltonian. 

We are particularly interested in calculating the average effective dimension in 
the mean energy ensemble. Whether or not the average effective dimension is 
large depends on the structure of the energy spectrum and the energy E to which 
the mean energy ensemble is taken. 

If E is close to Eq, the shift to make E ^ Eh is such that after the shift Eq is 



close to zero. The variances (2.3.36) and the expected moduli of the expansion 
coefficients with respect to the energy eigenbasis are then very nonuniform. This 
in general leads to a small average effective dimension. In the extreme case that 
E = Eq the only state in Me is the ground state \Eo) and d^^{uj) = 1. If on the 
other hand E is close to E0 = TT[J^]/d, a large positive shift is necessary to 



make E ^ Eh- The variances ( 2.3.36[ ) and the expected moduli of the expansion 



coefficients are then relatively uniform. This results in a large average effective 
dimension. In the extreme case that the shift is much larger than the spread of 
the energy spectrum we recover the Haar measure ensemble for which we already 



know that the average effective dimension is large (theorem 2.3.1). The more 
uniform the shifted energies are, the higher is the average effective dimension. 

This is reflected in the following theorem, which establishes estimates for the 
average effective dimension in the mean energy ensemble: 

Theorem 2.3.4. Consider a d- dimensional quantum system whose Hamiltonian 
J^, with eigenvalues Ek > 0, has non- degenerate energy gaps. LetE0 = Ti[,^]/d 
and assume that E is such that theorem 2.3.3\ can be applied and that the ener- 



gies have be shifted such that E ^ Eh = d/'Y^^^ Then the average purity of 
the time averaged state {Tt[u)'^]) ^^^j^j^ , where the average is computed over pure 
initial states ipQ drawn from the mean energy ensemble at energy E, is to good 
approximation given by 
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and the average effective dimension (^d"^^ (u)'^ ^^^^^^ is to good approximation lower 
bounded by 



// the spectrum of Jif fulfills some additional constraints both statements become 
exact in the thermodynamic limit — > oo (for details see 151]). 

Proof. The bound on the average effective dimension in the mean energy ensemble 
follows from the estimate of the average purity of the time averaged state as 



As the Hamiltonian has non-degenerate energy gaps the average purity of the 
time averaged state is 

(Tr[-^]W = E(I^^O^o.M.- (2.3.40) 



According to theorem |2.3.3| we can sample from the mean energy ensemble to 
good approximation by choosing the real and imaginary parts and hk of the 
expansion coefficients = cia, + i hk from normal distributions with variances 
^fc = Therefore 




(I^^O^ogAfe ^ I o fwzr^ I / / («^ + &D'e ^-^e -ida,rf6, (2.3.41) 



2^.' = !^, (2-3.42) 



and we find that0 



To prove the second inequalities we use the fact that the harmonic mean is up- 
per bounded by the arithmetic mean, which follows from the generalized means 

""^■^Note that in the hmit d — >■ c», where the sampling procedure becomes exact, and if E and 
all the are identical we recover the first part of theorem |2.3.1| for — d. The unitary 
invariant ensemble for the full Hilbert space is a special case of the mean energy ensemble 
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inequality [57] : 

E^Eh = <^Y.^^ = ^0 (2.3.44) 



Using theorem 2.3.4 and that E ^ Eh we get 



k ^ k 



□ 



Concluding we can say that as long as E is comparatively high, such that after 
shifting the energy levels Eq is not too many orders of magnitude lower than 
E0 the average effective dimension will be highj^ The closer E is to the ground 
state energy, the smaller is the average effective dimension. This is not surprising. 
Lowering the energy we expect to observe a transition from thermodynamic to 
quantum behavior. This is precisely what happens, for high E we get a high 
effective dimension, which, as we will see later, causes thermodynamic behavior, 
while for lower and lower E the effective dimension will decrease making quantum 
effects observable. 



2.4. Equilibration 

One of the most obvious features of thermodynamic systems is the tendency to 
evolve towards equilibrium. It is therefore not surprising that the oldest and best 
understood part of Thermodynamics and Statistical Mechanics is concerned with 
systems in thermal equilibrium. The tendency to equilibrate is postulated in the 
Second Law of Thermodynamics. Starting from this postulate one can use ensem- 
ble theory or the condition of detailed balance to derive equilibrium properties of 
physical systems. How, and under which conditions, the microscopic, time rever- 
sal invariant dynamics of such systems leads to equilibration and thermalization 
remains unexplained. 

"^*Note that high will usually mean that d'^^^'^) is much larger than the dimension ds of some 



small subsystem (see section 2.4.2 2.6.1 and 2.7 1. Keeping in mind that d grows exponen- 
tially with the number of constituents of the system 2i?^/((ii?g) will be large compared to 
ds even if Eq is several orders of magnitude smaller than E0 . 
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In a time reversal invariant theory equilibration in the usual sense is impossi- 
ble!^ therefore use an extended notion of equilibration and say that a system 
is in equilibrium when its density matrix stays close to some state, for almost all 
times and say that it evolves towards equilibrium if it approaches such a state, 
and then stays close to it, when started in a state far from equilibrium. Likewise 
we will say that an observable gives the impression of equilibration when its mea- 
surement statistics is compatible with the assumption of an equilibrated system. 
This is the case if its expectation value and higher moments are nearly stationary 
for almost all times. 



2.4. 1 . Equilibration of expectation values 

Under which conditions observables can create the impression of equilibration was 
recently investigated by Peter Reimann in |2l]. The main result of this paper is 
the following, very useful theorem which we present here in the form given in |35| : 

Theorem 2.4.1. \24l I35| Let A be an observable and let pt evolve under a Hamil- 
tonian with non- degenerate energy gaps, then 

((Tr[Ap,]-Tr[Aa;])^X<M|, (2.4.1) 

where u = {pt)f 

A similar result is derived in [S]. 



Theorem 2.4.1 is a very remarkable result. Whenever the effective dimension 
d'^^{(jj) is large, the time average of the square deviation of the expectation value of 
any observable from its time average will be small. Therefore, systems which are 
in a state with a high effective dimension will look like they were in equilibrium 



most of the time although in reality they evolve unitarily. Theorem |2.4.1 shows 



that a time reversal invariant theory can create the impression of equilibration. 



It shall be stressed that theorem 2.4.1 is a statement about the dynamics of 



states with a high effective dimension. It is crucial to note, that theorem |2.4.1 it 



is a much stronger statement than the usual typicality arguments often made in 
Statistical Mechanics. Such arguments state that there is a large set of equilibrium 



^At least in finite dimensional systems 
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states and that one can therefore expect that starting in a non-equihbrium state 
not in this set, one will sooner or later end up in an equilibrium state. In contrast, 
theorem |2.4.1| implies that initial states which are out of equilibrium, i.e states 
for which Tr[Apo] is far from Tr[y4a;] definitely will equilibrate whenever d'^^{uj) 
is large. It does however not make an assertion about how long it takes to reach 



equilibrium. We will come back to this problem in section 2.6.3 



2.4.2. Subsystem equilibration 

An even stronger result can be obtained for subsystems of large quantum mechan- 
ical systems. Very recently it has been shown in |25] that the dynamics of almost 
every large quantum system is such that for almost every pure initial state every 
small subsystem equilibrates. The main result of a recent work of Noah Linden 
et al. |25| is a rigorous bound on the expectation value of the trace distance of 
the reduced state of the subsystems from its time average in terms of the effective 
dimension d^^{uj): 

Theorem 2.4.2. (Theorem 1 in [25j ) Consider any pure state ipt evolving 
under a Hamiltonian with non- degenerate energy gaps. Then the average distance 
between pf = Tisi't o.nd its time average cu^ = (pf)^ is bounded by 



d 




(D(pf..'')>,<-^^<-W^ (2,4,2) 



Again it is of utter importance to understand that theorem 2.4.2 is a statement 
about the dynamics of states with a high effective dimension and therefore much 
stronger than a typicality argument. It implies that initial states which are out 
of equilibrium, i.e states for which is far from definitely will equilibrate 
towards uo^ whenever d^^{(jj) is large. Again it is difficult to make assertions 



about the time scales on which equilibration happens (see section 2.6.3). 

Of course this theorem only makes sense as long as the Hilbert spaces in- 
volved are finite dimensional. Only then can their dimension serve as a sensible 
measure for smallness of the subsystem. However, in |30l EH [29] it has been 
demonstrated that small subsystems of quantum systems with infinite dimen- 
sional Hilbert spaces also tend to equilibrate. The example considered in [30l [3T] 
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t 



Figure 2.1.. Plot of how the time evolution of the trace distance can be imagined. 

Starting in a state far from equilibrium the subsystem will evolve to- 



wards states close to the equilibrium state (see section 2.6.3 for more 
information about the time scales on which equilibration happens.). It 
will then stay close to the equilibrium state for almost all times. Oc- 
casionally fluctuations will drive it out of equilibrium but these events 
are extremely rare. After an extremely long time the system recurs to 
its initial state but the time scale on which this happens is enormously 
large for macroscopic systems. 



is a bosonic chain with quadratic couphng and in [29] a system consisting of 
oscillators coupled with a harmonic interaction Hamiltonian is investigated. In 
both works the measure of smallness of the subsystem is the number of units 
that constitute the subsystem. It is shown that all small subsystems equilibrate 
for squeezed pure initial product states while the whole system undergoes a uni- 
tary time evolution. Further numerical studies that confirm the analytical results 
presented above can be found for example in |59| [8]. 



2.4.3. Equilibration of tlie purity 

To further illustrate the phenomenon of subsystem equilibration we look at the 
purity of the subsystem state p(pf) = TrKpf)"^]. A necessary, though not suffi- 
cient, condition for equilibration is that the time average of the purity (p(pf))^ 
and the purity of the time averaged state p{u)^) are almost identical 

{pipf))^^piu'). (2.4.3) 

Their distance can be bounded as follows: 

Theorem 2.4.3. In a system evolving under a Hamiltonian with non-degenerate 
energy gaps the difference of the time average of the purity {|p(pf))^ and the purity 



35 



2.4, Equilibration 



of the time averaged state p{u}^) is bounded by 



ds + 2 
~d^^ 



(2.4.4) 



Proof. Expanding the initial state in the energy eigenbasis we can write the purity 

as 

p(pf) = CkcUmC: e-'(^'=-^'+^'"-^")*Tr5[TrB \Ek){Ei\ Ttb \E^){En\]. 

klmn 

(2.4.5) 

The sum over /c, /, m, n can be spht up into four parts as follows: 

E-"= E E E E 

klmn k^l,m^n k^l,m=n k=l,mj^n k=l,m=n 

The second and third sum contain only oscillating terms for which the time av- 
erage {■ ■ ■)i vanishes. The time average of the first sum is positive and equal to 



|cfcp |cmp TrB[Trs \Ek){Ek\ Tts \Em){Em\] 
TrB[u;^'] - J2 \ck\'TTs[iTTB \Ek){Ek\)% 



k,m 



(2.4.7) 



and the fourth sum contains only terms that are time independent and is thus 
equal to 



\ck? \cm? Tr^iTrB \Ek){Ek\ Ttb \E^){E^\] 



k,m 



(2.4.8) 



In the derivation of both (2.4.7) and (2.4.8) we have used equation (A5) from 



appendix A in [25J. The sums in the right hand side of (2.4.7) and (2.4.8) are 
both bounded by Tr[a;^] so that we find: 



\{p{pf))^-p{u')\<TTs[uj^'] + 2TT[u' 



(2.4.9) 
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Using the fact that 



gives the second bound. □ 



As we would have aheady expected from theorem 2.4.1 and 2.4.2 we find that 
the purity is close to a typical value, namely Trs'[(X''^^], most of the time whenever 
d'^^ is large. 

2.5. Ergodicity 

The question whether physical systems are (quasi) ergodic plays a central role 
in all Gibbs like attempts to justify the methods of Statistical Mechanics from 
Newtonian Mechanics [2]. Ergodicity is either used directly to identify time and 
ensemble averages, or as a way to justify the choice of a priory probabilities and 
the microcanonical ensemble. The question whether all, and if this is not true 
than which thermodynamic systems are quasi ergodic was investigated by many 
authors. In the classical setup the problem was reduced to the problem of showing 
metrical transitivity, and quasi ergodicity is proven for so called Kanonische Nor- 
malsysteme. Irrespective of these efforts the problem still awaits a full solution, 
so that quasi ergodicity is rather a hypothesis than anything close to a stable 
foundation for a physical theory. 

The approach towards the foundations of Quantum Mechanics we follow herein 
does not depend on quasi ergodicity. Nevertheless, due to its historical impor- 
tance, ergodicity is a property that deserves investigation in its own right. It 
turns out that the approach based on measure concentration techniques can be 
used to prove ergodicity: 

Theorem 2.5.1. Let (^Ti be a subspace of dimension dfi corresponding to a 
microcanonical constraint. The probability that the time average (Tr[i? T/^f])^ of the 
expectation value of an arbitrary observable B computed for a randomly chosen 
pure initial state tpQ G Vi{'Hr) differs from its microcanonical expectation value 
with respect to Hr is exponentially small in the sense that for every e > 



Pr{|(Tr[B^t])^-(5)^J >e} <2e (2.5.1) 
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where C is a constant with C = (36 tt^) ^ and 

$[B] = J2 \Ek){Ek\B\Ek){Ek\. (2.5.2) 



Proof. The proof is completely analogous to the proof of theorem 2.2.2 and relies 



on Levy's lemma. For an arbitrary fixed observable B we define the function 

fsM = (Tr[5V^t]), = TT[B$[i;o]]. (2.5.3) 

As corresponds to a microcanonical constraint the projector n^j on com- 
mutes with the Hamiltonian [Ilji,Jif] = 0. Thus the expectation value {fBiip))^^ 
of this function with respect to a randomly chosen pure initial states ipQ G Vi{'Hr) 
is 

(/i?(^o))^„ = (Tr[5 $[^o]])^, = Tr[fi $[(V^o)^J] = {B)^, . (2.5.4) 

Its Lipschitz constant rj with respect to the Hilbert space norm is upper bounded 
by 2||$[5]|U, as tl5|: 



Ifsi^i) - /b(^2)| = I Tr[5$[^i - ^2]]| = I TT[$[B]{iJ^ - ^2)]! (2.5.5) 

= lliiH + (H) m (I^Ai) - 1^2)) + {{H - (H) HB] + m)\ (2.5.6) 

< ||$[i?]||oo 111^1) + 1^2)112111^1) -1^2)112 (2.5.7) 

< 2 1^2)112 (2.5.8) 



Applying Levy's lemma (s. appendix [C|) to fsi'^Po) gives the desired result. □ 
Note that for time independent observables [J^,B] = 0, and consequently 



$[B] = B so that we recover theorem 2.2.2 If an observable is time dependent 



then II $[5] II 00 < 1 1 1 1 00 and the bound on the deviation is tighter than the bound 
of theorem [2X2I 



2.6. Dynamics of the state of ttie subsystem 

In the previous sections we have shown under which conditions expectation values 
and reduced subsystems of large quantum mechanical system equilibrate. We have 
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however left out a crucial point, namely the timescales on which equilibration 
happens. This section will be concerned with the dynamical properties of the 
state of the subsystem. 



2.6.1. Speed of fluctuations around equilibrium 

Knowing under which conditions the state of the subsystem equilibrates a natural 
question is: How fast will the fluctuations around the equilibrium state typically 
be? This question was investigated very recently by Noah Linden et al. |35] . 

The first step is to introduce a physically meaningful notion of speed. This is 
achieved by setting 



vit) 



lim 

st^o 



■St) 



St 



dpt 



dt 



where according to the von Neumann equation 

dpt 



dt 



i[pt,^]- 



Equivalently one defines the speed of the state of the subsystem as 



lim 

(St-j.0 



T^iPt^Pt+5t 

6t 



dpt 



dt 



with 



dpi 
dt 



i TTB[pt,J^]. 



(2.6.1) 



(2.6.2) 



(2.6.3) 



(2.6.4) 



As the choice of the origin of the energy scale does not influence the speed it is 
convenient to split up the Hamiltonian of the system as follows 



^ = ^0 + =^5 ®1 + 1 ® -^B + ^SB ■ 



(2.6.5) 



Thereby J^o is taken to be proportional to the identity and J^s, b and ^ sb 
are tracelessF^ 

Using this it is shown in [35] that: 



^^Note that this decomposition is not unique and the freedom can be used to optimize the 
quantities appearing in the following theorems. 
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Theorem 2.6.1. |35j For every pure initial state ipQ of a composite system evolv- 



ing under a Hamiltonian of the form of (2.6.5) and with non- degenerate energy 
gaps, it holds that 



dpt 



dt 



dl 



d-^{io) 



where pf = Tr^ ipt o-nd u = {ipt)t 



(2.6.6) 



Proof. We will only give a short sketch of the proof herein, for the full proof see 
the original article The first step is to show that the speed of the subsystem 
state can be written as 



dt 



fc=i 



such that 



dpf 



dt 



(2.6.7) 



(2.6.8) 



where the (i| operators Ck form an orthonormal basis for the set of hermitian 
operators on the Hilbert space of the subsystem such that Tr[efcei] = Sm and 



Ck{t) = Tr [pt i [M's + ^SB. Ck ® 1]] . 



(2.6.9) 



One then applies theorem 2.4.1 from j24j to A = i \^ s + sb-, ® 1] to bound 
the expectation value of the squared coefficients ((cfc(t))^)j. The final step is to 



use a standard bound connecting the Hilbert-Schmidt norm used in (2.6.8) with 
the trace norm. □ 



From theorem 2.3. 1[ |2.3.2| and the discussion in section 2^ we know that the 
effective dimensions d'^^{u) typically is very large in realistic situations. In par- 
ticular, as all dimensions grow exponentially with the number of constituents of 
the system it will usually be much larger than any fixed power of ds- There- 
fore, the speed of the subsystem will, most of the time, be much smaller than 
II J^s ®1 + SB lloo; which is the natural unit in which the speed of is to be 
measured 1351. 
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2.6.2. Fluctuations of thie purity of thie reduced state 



The bound on the speed of the state of the subsystem we have discussed in the last 
section depends on the interaction Hamihonian M' sb and the local Hamiltonian 
M' s- We are however primarily interested in understanding how the interaction 
with the environment leads to equilibration. It is therefore instructive to consider 
a quantity that does not feel the local dynamics of the subsystem and instead is 
a good measure for the correlations with the environment. Such a quantity is the 
purity of the subsystem 



p(pf ) = Tr[p: 



(2.6.10) 



We can establish a bound on the time average of the rate of change of the purity 

vi - Pt+st 



hm 



(2.6.11) 



dt st^o 6t 

that depends only on the strength of the interaction Hamiltonian: 

Theorem 2.6.2. For every initial pure state ipQ of a composite system evolving 



under a Hamiltonian of the form of (2.6.5) and with non- degenerate energy gaps, 
it holds that: 



dpi 




dpf 


dt 




dt 



(2.6.12) 



where u = {ipt)f 

Proof. The first equality is trivial as the purity of the system and the purity 
of the bath are always identical if ip is pure. This follows from the Schmidt 
decomposition of the pure state [62] . 
Note that 



Tr[p2] _ Tr[(T2] 



Tr[p2_a2j| < Tr|p2-a^| 
p + a 



2Tr 



< 2 max TrM (p-(t)1 

~ 0<A<1 ^ ^ 

= 2V{p,a). 



(2.6.13) 
(2.6.14) 
(2.6.15) 
(2.6.16) 
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We can therefore bound the average rate of change of the purity by 



dPt 



dt 



hm 



\Pt - Ph 



St\ 



< 2 hm 
<2vs{t). 



6t 



St) 



6t 



(2.6.17) 

(2.6.18) 
(2.6.19) 



Inserting this into theorem 2.6.1 gives 



dpf 




dt 





dpf 



dt 



<2\\,^S®t+'^SB 



4 



d^^{u) 



(2.6.20) 



Noting that J^s does not influence the rate of change of the purity one obtains 
the desired result. □ 

The above theorem tells us that the average rate of change of the purity is 
small, in addition we can establish bounds telling us when the rate of change of 



the purity must be small during a particular evolution. Form (2.6.7) and (2.6.9) 
we see that dp^ /dt depends on JifB only implicitly through the trajectory pt. We 
therefore have: 

Pf+5t = Pf + St (i [pf , J^s] + i [Pu ^sb\) + 0{be) (2.6.21) 



Inserting this into (2.6.11) gives 



dt 



Tr[pf2iTrB[A,^5B]] 
Tr[pt {2iTYB\pu^sB\ 



!)]• 



(2.6.22) 
(2.6.23) 



The operator 2 i TrB[pf, sb\ is hermitian and traceless. The more mixed pf is 
the more likely will it have overlap with both the eigenstates with positive and 
negative eigenvalues of this operator. Therefore, the more mixed the subsystem 
is, the slower is the rate of change of its purity. Rates near the maximal rate of 
change of 2 || TrB[pt, ^sbIHoo < 2 || sb ||oo can only occur if pt is such that the 
subsystem state pf is relatively pure. On the other hand if pt is a pure product 
state pf = and consequently dpf/dt = as pf is positive and differentiable. 
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Obviously, too little entanglement also leads to a slow rate of change of the purity. 

The consequence of the interplay of these two counter acting influences is the 
subject of the following theorem: 

Theorem 2.6.3. For every initial state po of a composite system evolving under 



a Hamiltonian of the form of (2.6.5) and with non- degenerate energy gaps the 



absolute value of the rate of change of the purity is upper bounded by 



dpf 



dt 



< 2 Hp? II oo y'^IsBiPt) II =^ SB 

<2pf ^2 IsB{pt)\\^ SB Wo., 



(2.6.24) 
(2.6.25) 



and if pt is pure this implies that 



dpi 



dt 



<ApU S{pI)\\^sb\U 



(2.6.26) 



where pl"'^ = Pt — pI ® pf , S{pf) is the von Neumann entropy of the reduced state 
and IsBi.Pt) is the quantum mutual information between the subsystem and the 
bath. 



Proof. In [UU] it is shown that instead of (2.6.22) we may write the rate of change 
of the purity as 

(2.6.27) 



^ = Tr [pf 2 iTrs [pr,^5B] 



where p^°^ = pt — p^ ® pf is the correlation operator [60j which satisfies 



\pT 



li < \/2IsB{pt) 



(2.6.28) 



and IsB{pt) is the quantum mutual information. Using this one can easily see 
that 



dpi 



dt 



<2||pfiuii[pr,=^5B]iii 

— 2 ||pf lloo II SB lloo ||Pt°''lll 
II SB II oo 



2IsB{Pt 



< 2pl y/2IsB{pt) W^SB 



(2.6.29) 

(2.6.30) 
(2.6.31) 
(2.6.32) 
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where in the last inequahty we have used the fact that ||pf ||oo < II pf II 2 = Pt ■ For 
pure states pt the quantum mutual information reduces to 

IsB{pt) = S{pf) + S{pf) - S{pt) = 2 S{pf). (2.6.33) 

□ 



2.6.3. Equilibration time 

Only if equilibration happens on reasonable time scales the mechanism of equili- 
bration presented herein gives a satisfactory explanation for the irreversible be- 
havior of our every days world. 

It is obvious that when the Hamiltonian is multiplied by a positive constant 
factor the dynamics of the system speeds up or slows down by exactly this factor. 
Similarly one might expect that a rescaling of the interaction part of the Hamil- 
tonian will increase or decrease the equilibration time. But, other properties, like 
the relative orientation of the eigenbasis of the subsystem Hamiltonian and that 
of the interaction Hamiltonian, its interaction range, the spacial extend of the 
system, as well as the fine structure of the spectrum also have crucial influence on 
these timescales. Due to the generality of the approach pursued herein we cannot 
say much about the timescales on which equilibration happens, although we will 
give some bounds below. This has provoked well justified criticism |18] . 

However, In specific models it is possible to calculate equilibration times explic- 
itly and it turns out that they have reasonable values. This was shown in some 
analytical works |2H1 EHl EI] as well as in numerical studies [2Z1 1121 EHl SSI EE E]- 

Some simple estimates can be made even without specifying a model in detail: 
Assume that the initial state ipo was drawn from the subspace of energy eigenstates 
with eigenvalues in the interval [E, E + AE]. Then 

^^(t) = lim ^^P'f'+st) (2.6.34) 



1 


dpt 


1 

< - 


dpt 


2 


dt 


1-2 


dt 



(2.6.35) 



= -\[yr,pt]\,<AE (2.6.36) 
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If the system starts in a non-equilibrium state the initial distance "D^Pq, co^) from 
the time averaged, equilibrium state can be expected to be of order 1. So that 
even if the subsystem state immediately starts to head towards u with maximal 
speed it will take at least a time span of the order of magnitude of the Heisenberg 
time 

T ^ ^ (2.6.37) 



AE 

until the equilibrium state is reached. That equilibration can indeed happen on 
timescales that are roughly of the order of magnitude of T can be seen from the 
numerical simulations presented in [27j. 

Another non trivial bound on the equilibration time can be obtained from 



theorem 2.6.3 For pure joint system states we found the following bound on the 
rate of change of the purity: 



dPt 



dt 



< 4pf VMrfs) ||^5ij|U 



(2.6.38) 
(2.6.39) 



Now, assume that the initial state is a pure product state so that = 1 and that 
the equilibrium state has purity . By integrating the differential equation for 



the purity (2.6.38) one finds that the time T to reach the equilibrium purity p; 
is bounded by 



T > 



log(^) 

f eq 



(2.6.40) 



4v/log(d5)||^5B||oo 

Equilibration in a shorter time is impossible, even if the evolution is such that 
the purity decreases with the maximal possible rate. If the equilibrium state is 
the maximally mixed state pfq = \jds the minimum time until equilibration is 



T > 



Vlog(c^6 

4 II =^55 I 



(2.6.41) 



We find that equilibration to a state with fixed purity can happen the faster 
the larger the system is. The minimal time decreases like l/y^rf^. In contrast, 
equilibration to the completely mixed state takes longer the larger the system is. 
Here the minimal time increases with ^J~ds. In both cases most time is spend 
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during the final approach, as according to (2.6.38) the rate of change gets slower 
the lower the purity is. The more relevant time scale, even for equilibration 



towards the completely mixed state therefore is (2.6.40), as there will be some 
value of the purity, independent of ds, from which on the state will be practically 
indistinguishable from the completely mixed state. 



2.7. Equilibration and einseiection 

The term einseiection, which stands for environment-induced super selection, is 
due to Zurek p5| E]- Einseiection is known to occur in situations where the 
Hamiltonian of the composite system leaves a certain orthonormal basis of the 
subsystem, spanned by so called pointer states \p), invariant If this is the 
case, the Hamiltonian and the time evolution operator have the form 

^ = ^|p)(p|®^(p) (2.7.1) 
p 

Ut = J2\P)(P\®Ut'\ (2.7.2) 
p 

where ui""^ = e-' -"^^"' * and the ^^^^ are arbitrary hermitian matrices. One finds 
that the subsystem state of an initial product state of the form Po = Po ® "^(f ' 
where the state of the bath can be assumed to be pure without loss of generality, 
evolves into 

pf = E \p){p\Po\pW\ MUI''^' Ui'^l^^) (2.7.3) 
pp' 

Under the evolution induced by such a Hamiltonian the diagonal entries of Pq, 
when expressed in the pointer basis, remain unchanged while the off-diagonal 
entries are suppressed by a factor of {ip^lu'f '' ul^^ip^) < 1. The actual time 
development of the {ipoWt^^ u\:^\iPq) depends on the explicit model under con- 
sideration, but for many models they have been found to decrease rapidly over 
short time scales jZl [5l |9l |6l [63]. If some of the J^^"^"* lead to an identical time 
development for the chosen initial bath state there exist subspaces of Us in which 
coherence is preserved and in which quantum mechanical superpositions survive 
the interaction with the environment. 
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Note that, the diagonal entries, which survive the decoherence, are completely 
determined by and do not depend on the initial state of the bath tpQ at all. 
The direct opposite situation is the thermodynamic case where the final state is 
completely determined by the properties of the bath. Most realistic situations 
surely lie between these two extremes. 



Using the results discussed in section 2.6.1 it is possible to get rid of the quite 



limiting assumption on the form of the Hamiltonian and to shown that einselection 
is a more general phenomenon. Besides the usual assumption of non-degenerate 
energy gaps and d-B ^ ds, to ensure a large average effective dimension for almost 
all pure initial states, we only need to assume that the interaction Hamiltonian 
^ SB is weak. Note however, that in this more general setting, we can currently 



not say much about time scale on which the decoherence happens (see section 2.6.3 
for more details). The following discussion was earlier published by the author in 
[64J. 



We have seen in section 2.6.1 that for a system with a Hamiltonian of the form 



^ = ^0 + ^5®l + 1 ®=^B + ^5S, (2.7.4) 

where is proportional to the identity and b and ^ sb are traceless, 

the velocity of the subsystem state pf is 

^ = f;c,(t)e. (2.7.5) 

fc=i 

where the (i| operators form a hermitian orthonormal basis for 1-is such that 
TrfefcCi] = 6u and 

Cfc(t) = Tr {pit) i \.^s ®1 + ^SB, efc ® 1]] . (2.7.6) 

The velocity depends on b only implicitly through the trajectory pf, but for 
an arbitrary fixed state p the velocity is solely determined by ^ s and J^sb'- 



dp' 



s 



i[p'',jrs]+iTTB[p,^SB] (2.7.7) 



dt 



Now if J^fsB is much weaker than J^s, (2.7.7) is dominated by the first term. 
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Consequently, the system can only become slow when [p^ , J^s] is small. 

To see when this happens we first establish a general lower bound on the norm 
of commutators between states and arbitrary hermitian matrices: 

Lemma 2.7.1. Let p be a normalized state and A a hermitian observable with 
eigenvalues ak and eigenvectors \ak), then 

II [p, A] 111 = ||i[p,A]||i > 2 max Vlafc-a^l Ip^^l (2.7.8) 

> 2max |afc - a;| |pfci|. (2.7.9) 



where the maximization in (2.7.8) is performed over all decompositions of the 
index set {!,..., ds} into non-overlapping pairs {k, I) over which the sum is per- 
formed and pki = {ak\p\ai). 

Proof. The equality is trivial. For all traceless, hermitian, bounded operators B 
on some finite dimensional Hilbert space H it holds that [62j 

||fi||i=2 max Tr[n5l, (2.7.10) 

nev{n) 

where V{'H) is the set of all projectors on T-i and the maximum is obtained when 11 
is the projector onto the positive subspace of B. By expanding p in the eigenbasis 
of y4, using the above equality for B = [p, A] and considering all sums of mutually 
orthogonal rank one projectors Um of the form 

Hfcz = \nki){nki\ M = -^iWk) + e'^^' |a;)), (2.7.11) 



where (pki are phase factors, one easily verifies (2.7.8). The second inequality is 



trivial. □ 

Using the above lemma we can now proof the following theorem: 
Theorem 2.7.1. Consider a physical system evolving under a Hamiltonian of the 



form given in (2.6.5) and with non- degenerate energy gaps. All reduced states p^ 
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satisfy 



SB oo 



dp' 



dt 



"'''^^ ik,i) 

>max|i?f-i?f||pf,|, 



(2.7.12) 
(2.7.13) 



where pfi = {E^\p^\E^) and and \E^) are the eigenvalues and eigenstates of 
s. 



Proof. Using the inverse triangle inequality and (2.7.7) we see that 



i[p^^5]||l-||iTrB[p,^SB]||l|< 



dt 



(2.7.14) 



For \\dp^/dt\\i to become small the norms of the two commutators must be ap- 



proximately equal. Applying lemma |2.7.1| to the norm of the first commutator 
yields: 

||i[p^^5]||i > 2 max|Ef -Efl \pl\ (2.7.15) 

The norm of the second commutator can be upper bounded, using the well-known 
fact that the trace norm of traceless, hermitian matrices is non-increasing under 
completely positive, hermitian, trace-non-increasing maps |62) as follows: 



i Tib[P,^Sb\\\i < \\[P,^Sb\\\i <n^SB Woo 



This completes the proof. 



(2.7.16) 



□ 



The assertion of theorem 2.7.1 is almost intuitively clear, but combined with 



theorem 2.6.1 it allows to draw the following powerful conclusion: Whenever 
d'^^iu) is large the subsystem is slow most of the time and if this is the case 
coherent superpositions of eigenstates of ^ s with eigenvalue differences that are 
much larger than || ,^sb ||oo niay not contribute significantly to the state of the 
subsystem. That is, the corresponding off-diagonal elements of the reduced state 
pf in the eigenbasis must be small. A similar behavior was observed for a 
specific model in |65) . Without using any approximations we have shown that 
coherence can only be retained between eigenstates of ^ s whose energies differ 
by less than || M' sb ||oo- This statement remains meaningful even when the sub- 
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(c) (d) 

Figure 2.2.. Imagine the small subsystem is a single spin 1/2 particle and has 
eigenvectors |0) and |1). If no interaction with the bath were present 
the state of the subsystem would rotate around the z-axis with a con- 
stant speed (a). The interaction Hamiltonian J^sB gives an additional 
contribution to the velocity of the reduced state. As the interaction is 
assumed to be weak it can in general not significantly slow down the 
rotation of the reduced state due to the local Hamiltonian, even if it its 
contribution to the velocity points in the directly opposite direction (b). 
Only if pf is close to an eigenstate of J^s^ such that [p^ ^J^s] becomes 
small, the speed of the reduced state can become slow (c). The interplay 
of the two Hamiltonians defines a set of slow states (green) (d). As we 



know from theorem |2.6.1 that the speed of the subsystem state is almost 
always slow must spend most of the time in this allowed region. 
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system is large and its energy spectrum thus very dense. Theorem 2.7.1 then still 
implies that coherent superpositions of eigenstates with far apart energies (some- 
times called Schodinger cat states) must decohere. If the subsystem is small and 
the interaction Hamiltonian weak compared to the energy gaps of the subsystem 
Hamiltonian it implies an even stronger statement. The state of the subsystem 
must then, most of the time, be approximately diagonal in the eigenbasis of ^^ s- 
The consequences of theorem |2.7.1 are twofold: 

1) It proofs the existence of a natural decoherence mechanism in weakly in- 
teracting systems that causes decoherence in the local energy eigenbasis. This 
effect is indeed observed in many situations where the local Hamiltonian is much 
stronger than the interaction. A well-known example are electronic excitations of 
gases at moderate temperature. The energy gaps between the ground state and 
the first few excited states are typically much larger than the thermal energy. The 
dynamics of such systems is successfully described using transition rates between 
energy eigenstates. Ultimately theorem 2. 7.1 [ explains why this is eligible. 

2) It can be seen as an intermediate step of proving relaxation to the Gibbs 
state. It goes beyond the results of |25] and sheds some light on the roll of the 
weakness of the interaction in thermalization. This is interesting as the deriva- 
tions of the canonical state given so fare [T3| [HH [T^ [T71 [T5] either need to make 
very special assumptions, or at least partly depend on heuristic arguments. See 
section 2.2.2| and 2.8. 1| for a more detailed analysis of these results and a more 
elaborate explanation of this criticism. 



2.8. Initial state independence and ttie 
Second Law 

The final state of a small subsystem coupled to a large heat bath is typically 
independent of its initial state and is completely determined by some macroscopic 
properties of the bath. This key feature of thermodynamic systems is expressed 
in the Second Law of Thermodynamics. The obvious example is the canonical 
Boltzmann state which is completely specified by giving the local Hamiltonian 
and the temperature of the bath. One would therefore like to know under which 
conditions the equilibrium state of the subsystem of a large quantum mechanical 
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system is independent from the initial state and depends only on the subspace 
from which the bath state was drawn. 

Showing initial state independence is a prerequisite for thermalization and an 
important step towards a derivation of a Quantum Mechanical replacement of the 
Second Law. Unfortunately, it seems to be a quite difficult task to identify reason- 
able conditions under which initial state independence holds and there certainly 
are situations where it is violated. 



2.8.1. Conditions on tlie IHamiltonian 

From the form of the time averaged, equilibrium state it is obvious that the 
Hamiltonian, and in particular the form of the marginals of the populated energy 
eigenstates, is decisive for whether the equilibrium state depends on the initial 
state of the subsystem or not. 

In theorem 3 in [25] the following bound on the trace distance of the time 
averaged system state = {TTB[4't])t^ that belongs to a random initial pure 
state pQ G 'Pi{'Hr) chosen from a subspace I-Lb. C = T-Ls^T^b-, form the 
reduced microcanonical state p^^, = TrB[pmc] is established: 




(2.8.1) 



Thereby (■)^^ is the average over random pure initial states, 

^ = y,{Er,\^\Ek)i:Ts[{'^rB\Ek){Ek\f] < 1, (2.8.2) 

and Ufi is the projector onto Hr. This theorem is used to argue that if the state 
of the bath is fixed, i.e. "H/j = 'Hs ®\^)b and thereby = ds, the time averaged 
state of the subsystem is independent of its initial state if the energy eigenstate 
are highly entangled, i.e if 

TTB\Ek){Ek\^ ^tdsxds- (2.8.3) 
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However, even under this assumption one has: 

k ^ 

dn ds ^ ds 
Consequently the best estimate one can gain by applying the above bound is: 

This result is quite counter intuitive. One would expect that the time averaged 
equilibrium state us depends less on the initial state of the subsystem the smaller 
it is compared to the bath. In contrast to this the above bound gets tighter the 
larger ds is and does not depend on ds at all. One can therefore anticipate that 
the above bound is not tight for small subsystems and large baths. 

And indeed, if all energy eigenstates in some subspace have similar marginals, 
and in particular, if they are all highly entangled, it is possible to show initial 
state independence whenever ds <S d^^{u): 



(2.8.4) 
(2.8.5) 



Theorem 2.8.1. Let all energy eigenstates that span some subspace Hr ^ H of 
the total systems Hilbert space have similar marginals in the sense that 

max V{TrB\Ek){Ek\,TrB\Ei){Ei\)^5, (2.8.7) 

\Ek),\Ei)eHR 

or be highly entangled in the sense that 

max 2V{TrB\Ek){Ek\,^)^S. (2.8.8) 

\Ek)&nR ^ ' " ' ds 

Then the time averaged distance of the marginals pf and erf of any two pure initial 
states from I-Lr evolving under a Hamiltonian with non- degenerate energy gaps is 
upper bounded by 
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Proof. Using the triangle inequality twice we see that 
{-Dipf, af )>, < (I)(pf , (pf > + (P(af , (af > + P((pf >^ , (af > J. (2.8.10) 

The terms (T){pf , (pf )^))^ and (T>(a^ , (erf )^))^ can both be bounded using the- 
orem 12.4.21 and 

T^i{pf),,{^f),)<S. (2.8.11) 

□ 



On the first sight theorem 2.8.1 seems to be a quite nice result. The assumption 
of similar reduced states allows for some dependence of the final state of the 
subsystem on the macroscopic features of the bath. For example, assume that 
the marginals of the energy eigenstates are all close to the canonical state for 
the temperature associated with their respective energy. All marginals of the 
eigenstates that belong to some energy interval would then be close to the correct 
canonical state and we would recover the situation of equilibration towards a 
Boltzmann distribution known from classical Statistical Mechanics. 

Unfortunately, if the macroscopic properties are not primarily determined by 
the subsystem, and this is exactly the situation we are interested in, it seems to be 
unreasonable to assume that most energy eigenstates from some energy subspace 
have similar marginals. This assumption is called the eigenstate thermalization 
hypothesis and was first suggested in |22] (see also [42j) to explain thermalization. 
Although it was shown in [42j that the expectation values in the energy eigen- 
states of some reduced observables of an example system are close to a continuous 
function of energy the author does not find this assumption very convincing. In 
the very weak coupling limit, where we would like to recover the Boltzmann dis- 
tribution, the energy eigenstates are usually assumed to be close to product [IT] 
and their marginals thus are not at all similar. Changing the subsystem part of 
the initial state will have a significant impact on which eigenstates have a non 
vanishing overlap |cfcp = \{Ek\ipo)\^ with the initial state. Obviously only those 
l-Efc) whose system marginal are similar to the system part of the initial state, i.e. 
the states with TiB^Ek){Ek\\ ~ ipQ, will be populated. Thus, the initial state of 
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the system will have a non negligible impact on how the equilibrium state 

[pf), = Y.\^k\'TTB[\Ek){E,\], (2.8.12) 



will look like if the energy eigenstates \Ek) are close to product. This is true even 
for initial states with a high average effective dimension. Thus we can get equili- 
bration without initial state independence and thus without thermalizationj^ 

What do we learn from that? To have a chance of proving initial state inde- 
pendence, one at least needs some entanglement in the energy eigenstates and 
therefore a coupling Hamiltonian J^sb which is in strength at least comparable 
with the energy gaps of the non-interacting Hamiltonian J^s ®1 + 1 ® J^b- 



This condition is not to be confused with the assumption on the gaps of Jif 



s 



we have worked with in section 2.7 It is well possible that || J^sb ||oo is small 
compared to the gaps A{J^s) of the subsystem Hamiltonian while at the same 
time large compared to the gaps A{J^s^^ -l- 1 ® J^b) of the non-interacting 
part of the Hamiltonian: 

A(^5 ®1 + 1 ® ^b) < II ^SB Hoc < A(^5) (2.8.13) 

This can be expected to be the natural situation in thermodynamically large 
systems, as the density of energy states typically increases exponentially with the 
size of the system. 

If II J^sB Hoc ^ A.{J^s ®1 + 1 (S) ^b) the equilibrium state can be expected 
to not be robust against unitary transformations of the subsystem part of the 
initial state. Works claiming to derive the canonical ensemble under this as- 
sumption, or equivalently under the assumption that the energy eigenstates are 
close to product, and works that do not explicitly exclude the case || JifsB ||oo ^ 
A[j^s ®1 + 1 ® J^b) should therefore be considered with a healthy amount of 
mistrust (compare jTTl EZl [IS])- 



A work that will elaborate more on this point is currently in preparation. 
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2.8.2. Highly entangled eigenstates and random 
Hamiltonians 

As we have seen in the last section, highly entangled energy eigenstates are suffi- 
cient for initial state independence. However, this intuitively seems to be a very 
special property presumably not found in most realistic systems. It is however 
possible to show that almost all random Hamiltonians actually have highly entan- 
gled eigenstates. Where almost all has a mathematically precise and well defined 
meaning. 

The argument is based on the fact that random pure states are highly entangled 
with very high probability: 

Lemma 2.8.1. Given the Hilbert space H = Hs ®'Hb of a composite system with 
ds ^ ds- The reduced state = Ttb4' of a random pure state ip G Vi{'H) is 
with very high probability highly entangled in the sense that 



Pr > e| < 2 (^^^J e-^'^'^^ , (2.8.14) 

where C = (14 ln(2))-^ 

Proof. The lemma is a direct corollary of lemma III. 4 in [66j, which establishes 
a bound on the probability that one of the eigenvalues Aj of the reduced state 
= Ttb'iP oi a. random pure state ip G Vi{T-L) differs from 1/ds by more than 
e/ds, namely: 

where C = (14 ln(2))^^. If non of the eigenvalues of TiBip differs from l/dg by 
more than e/ds then ,1/ds) < e. □ 

Before we can proceed we must specify what we mean by random Hamilto- 
nian. A random Hamiltonian is a hermitian matrix whose eigenbasis was chosen 



according to the unitary invariant Haar measure, We do not put any special 



After finishing this section of the present work related considerations, but with a quite different 
intention, were published in |67| . 
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restrictions on the eigenvalues of the Hamiltonian except that we assume that the 
energy gaps are non-degenerate and the spectrum bounded. A uniform random 
orthogonal basis for "H = "Hs ® which we think of as the eigenbasis of such a 
random Hamiltonian, can be constructed by applying the same random unitary 
transformation, chosen according to the Haar measure of the unitary group, on 
every element of an arbitrary initial basis. The eigenvectors \Ek) of such a basis 
thus each look exactly as if they where random vectors in "H. Using the union 
bound and lemma [2.8.11 we therefore find that{^ 

Lemma 2.8.2. All eigenstates \Ek) of a random Hamiltonian on 1-1 = Hs^Hb 
with ds ^ ds are with high probability close to maximally entangled in the sense 
that 

Pr !^3k : V{Ttb \Ek){Ek\, > e| <2d(^^^ ' e-^^-^^ (2.8.16) 
where C is a constant with C = (14 ln(2))~-'^. 



2.8.3. Towards a probabilistic quantum Second Law 

The Second Law of Thermodynamics is probably one of the most mysterious pos- 
tulates ever made to justify a physical theory. There are many different versions 
of it, beside others there are versions due to Clausius, Kelvin, Planck and Boltz- 
mann, and it is not easy to see how exactly they are related or whether they are 
equivalent. One might even be tempted to say that there is not such a thing as 
the Second Law of Thermodynamics. Basically the only obvious feature shared 
by all these Second Laws is that they introduce some sort of irreversibility. Irre- 
versibility is an obvious property of many processes in our everyday world. But 
exactly this irreversibility is in conflict with the time reversal invariance of all 
microscopic theories [31 [D EH]- 

Trying to derive the Second Law from Quantum Mechanics thus seems to be 
a hopeless endeavor. However, if we take (i) a tendency to equilibrate and (ii) 
a tendency to increase disorder as measured by some entropic quantity as the 



most important aspects of the Second Law, then theorem 2.3.1 and 2.4.2 together 



^^This simple proof of lemma 2.8.2 was suggested by Andreas Winter after the author had 
established a slightly weaker statement with a much more involved proof. 
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with theorem 2.8.1 and lemma 2.8.2 are sufficient to derive a probabihstic pseudo 
Second Law from just standard Quantum Mechanics: 

Theorem 2.8.2. (Probabilistic pseudo quantum Second Law) Given an 
arbitrary fixed pure initial state of a large bipartite quantum system "H = Hs ^'Hb 
with ds <^ ds- The time evolution of almost every random Hamiltonian is such 
that the reduced state on S is close to an equilibrium state for almost all times. 
This equilibrium state does not depend on the initial state of the system and 
maximizes the local von Neumann entropy. 



Proof. Theorem 2.8.1 proofs initial state independence if the eigenstates of the 



Hamiltonian are highly entangled and lemma 2.8.2 shows that this is the case for 



almost all random Hamiltonians if ds ^ ds- The randomness of the initial state 



needed in theorem |2.3.1| can be absorbed in the randomness of the eigenbasis 
of the Hamiltonian. A fixed initial state has a high effective dimension with 
respect to almost every Hamiltonian. Under this condition theorem 2.4.2 ensures 
equilibration of all subsystems with ds ds- □ 

First of all it is crucial to note that the theorem stated above is a statistical 
assertion. It does not deny the possibility that a system near the completely 
mixed, equilibrium state suddenly becomes purer and therefore is immune to 
both the reversibility paradox raised by Loschmidt and the recurrence objection 



raised by Poincare against Boltzmann's famous H-Theorem [11 [58] (Figure 2.3). 



Theorem |2.8.2| states that a system started in a pure state will have a tendency 
to evolve towards less pure states, thereby increasing the systems von Neumann 
entropy. This is what Ehrenfest would have called a statistical H-Theorem [H 
12]. Such a statistical assertion is the strongest that is compatible with a time 
reversal invariant microscopic theory and therefore the best we can hope for in 
the framework of Quantum Mechanics (again see jT], especially chapter 4). 

As said above there exist various versions of the Second Law, but they all apply 



to thermally isolated or closed systems, while our theorem 2.8.2 is a statement 
about small subsystems of large fully interactive quantum system. Although 
these canonical versions of the Second Law obviously contradict standard Quan- 
tum Mechanics a legitimate question might be: What is the connection between 



theorem 2.8.2 and the canonical versions of the Second Law? According to the 
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Figure 2.3.. Schematic particles-in-a-box pictures depicting Loschmidt's time re- 
versibility paradox and Poincare's recurrence objection, which show that 
conventional Second Law of thermodynamics and in particular Boltz- 
mann's H-Theorem contradict time reversal microscopic dynamics [V, '5SJ • 
The former (a) is an argument which shows that for every initial non- 
equilibrium situation of a time reversal invariant system that evolves 
towards equilibrium there is an initial equilibrium situation, that can be 
constructed from the former by inverting the time evolution (which is 
equivalent to reverting all momenta in the classical setting) , that evolves 
out of equilibrium. The later (b) is a theorem which shows that certain 
energy conserving systems recur arbitrarily close to their initial state 
after a possibly very long time. 



interpretation of the system-bath setup this question can be answered in two 
different ways: 

First, one might argue that it is generally impossible to isolate a macroscopic 
system from its environment and think of the bath as the laboratory and the 
subsystem as the thermodynamic system under consideration. Of course the 
Hamiltonian that describes such a situation is most likely not of the random form 
for which our theorem holds. 

Second, taking into account that a realistic measurement on a macroscopic sys- 
tem, such a measurement of the pressure in a gas container or the magnetization 
of a macroscopic magnet, usually act only on a small part of the whole system (see 
for example P] especially p. 306 and [21J). In the first example only the average 
momentum of the atoms that hit the detector membrane during the observation 
time is measured. In the second example the observable is a sum of operators that 
measure the magnetization of each individual magnetic moment in the magnet 
and thus a sum of local observables acting on reduced subsystem states of a large 
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interacting system. One may then think of the joint system as the system under 
consideration (the gas in the container or the magnet) and may assume perfect 



isolation from the environment,^ This leads to a highly speculative point of view, 
namely that our impression that large isolated systems tend to equilibrium might 
just be an illusion that arises from the fact that when we think that we measure 
properties of macroscopic systems in reality only a relatively small subsystem is 
measured. From finding such subsystems in an equilibrium state we spuriously 
infer that the whole system must be in an equilibrium state too. This conclusion 
would be correct in a classical world, but, as can be seen from the discussion 
above, this is not necessarily correct if the system is quantum. 



Theorem 2.8.2 is somewhat stronger than what one would have wanted to show. 
It implies for example that by measuring the subsystem we can get no information 
about the total energy of the composite system at all. The reason for this is simply 
that realistic Hamiltonians usually comprise only short range interactions such 
that energy is an extensive quantity and that the decomposition of the joint 
system into a bath and a system usually corresponds to a division of the whole 
system in two spatially disjoint regions. Mathematically typical Hamiltonians are 
not necessarily realistic Hamiltonians. 

It is therefore of outstanding importance not to misinterpret the above result. 
By making the above statement the author does not want to imply that random 
Hamiltonians are in any way realistic. The author is well aware that this is not the 



case. However theorem |2.8.2| shows that there is a natural tendency to approach 
equilibrium and to maximize entropy. Traces of this tendency are expected to 
be found also in more realistic situations. The result raises the hope that by 
imposing further constraints on the Hamiltonian, like finite interaction range, 
extensivity of energy, or conservation of certain quantities, one might be able to 
proof a theorem that comes closer to a realistic Second Law of Thermodynamics 
than the one presented above. 



Although the very fact that the gas is trapped in the contained imphes that there must be 
some form of interaction. 
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We have made an attempt to rebuild the foundations of Statistical Mechanics 
and Thermodynamics form an underlying microscopic theory, namely Quantum 
Mechanics. Instead of relying on additional postulates we seek for a justification 
of the methods of Statistical Mechanics from first principles. Our approach is 
genuine quantum as randomness and statistical behavior emerge as a consequence 
of uncertainty relations and entanglement with the environment. 

The approach gives a measure theoretic justification for the microcanonical and 
canonical ensemble and is capable of explaining the tendency to evolve towards 
equilibrium in a natural way. New bounds on the time scales on which equili- 
bration happens have been obtained. We have identified a generic decoherence 
mechanism that makes the states of systems that interact weakly with an en- 
vironment become approximately diagonal in the energy eigenbasis and we have 
derived a Second Law of Thermodynamics from Quantum Mechanics. In addition, 
the measure theoretic foundations of the approach are strengthened by giving new 
bounds on the average effective dimension in the mean energy ensemble and for 
initial product states. 
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A. Distance measures for quantum states 



In this work we make use of a couple of different norms and distance measures for 
quantum states. The most important of which is the trace norm and the trace 
distance. 

Let p G M.{'H) be a normahzed density matrix with eigenvalues {pk} and 
d = dim(?^). Its Li-norm, or trace norm, of p is defined to be 



IpIIi = Tr \p\ = Tr[v^] = ^ \p,\. (A.0.1) 



The trace distance is proportional to the metric induced by this norm 

T^ip, ^) = ^IIp - ^lli = ^ Tr \p - a\ (A.0.2) 
= max TrM(p - a)] (A.0.3) 

= max Tr[n(p-a)l. (A.0.4) 

nev(H) 

That the definitions given above are indeed equivalent can be seen as follows: 
[p — o") is a traceless hermitian operator, the normalized hermitian observable A 



that maximizes (A.0.3) thus is the projector onto the positive subspace of (p~a), 
which is in turn equal to 1/2 Tr |p — (t| [62j. 

Equation (A.0.3) shows what makes the trace distance T>{p, a) so special among 
other possible distance measures: It can be interpreted as the physical distin- 
guishably of p and a. If two states are close to each another with respect to trace 
distance there is no measurement by which they can be distinguished. 

As the trace norm and the trace distance depend only on the eigenvalues of their 
arguments it is manifest that both are invariant under unitary transformations 

yUeSU{d): V{U pU\UaU^) = V{p,a). (A.0.5) 
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A. Distance measures for quantum states 

In fact, the trace norm is the largest unitary invariant norm in the sense that 
infill < ll^lli foi' ^-ll hermitian operators A and all unitary invariant norms ||| - ||| 

m- 

If two density matrices p and a with eigenvalues {p^} and {g^} commute, the 
trace distance reduces to one half of the classical Li-distance of their spectra [62] 

[p,a] = =^ P(p,a) = ^ (A.0.6) 

k 

and for pure states the trace distance is related to the Hilbert-Schmidt norm via 

= v/l-|(^|^)P < 111^) - |^)||2, (A.0.7) 
while for mixed states one has |25| 

V{^,^) < ^VdTr[(p-a)2]. (A.0.8) 
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B. The Haar Measure 



A Haar measure is a Borel measure in a locally compact topological group 
X, such that ^{x B) = ^{B) for every x G X and every Borel set i? C X and 
> for every non-empty Borel open set B. Equivalently one can say that a 
Haar measure is a left invariant Borel measure that is no n- vanishing |70] . 

Of particular importance in the context of Quantum Mechanics is the special 
unitary group in d dimensions SU {d). It can be shown that on SU (d) there exists 
a left and right invariant measure ^su{d) which is unique up to normalization [70| . 
The normalization can be chosen such that lJ'SU{d){SU{d)) = 1. This measure is 
what we call the Haar measure on the special unitary group. 

The measure ^su{d) induces a "uniform" measure on the set of pure quantum 
state vectors oi a. d dimensional quantum system in the following way: Choose 
an arbitrary fixed pure reference state and apply random unitary transformations 
from ^su(n) to it. Due to the left invariance of fisu{n) the resulting distribution 
of pure states will also be invariant under unitary transformations and therefore 
is "uniform" on the set of pure states. 

For actually sampling random pure states the above construction is not very 
useful as sampling random unitary transformations from fisu{n) is quite expensive. 
Fortunately there is a much easier method to construct "uniform" random pure 
states of ad dimensional quantum system: Choose the entries of a2d dimensional 
real vector x according to the normal distribution. Normalize x such that ||x|| = 1 
in the standard euclidean norm || ■ ||. Use the 2d entries of x as the real and 
imaginary parts of the d complex expansion coefficients of the random vector 
with respect to some arbitrary fixed orthonormal basis. 
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C. Levy's lemma and its application in 
Quantum Mechanics 



Levy's Lemma is a measure concentration result useful in high dimensional ge- 
ometry. It has previously been used in the context of Quantum Mechanics in [T5 
and 1251. 



Lemma CO. 3. Levy's lemma [71| Let / : S"^ ^ H be a real valued function 
on the {d — 1)- dimensional Euclidean hypersphere with Lipschitz constant 

,= sup ^^fJzif^^ (C.O.l) 

where \\ ■ \\ denotes the standard euclidean norm. Then, for a uniformly random 
point X G S'^^^, 

Pr{|/(f)-(/)|>e}<2e-'^ (C.0.2) 

where (/) is the expectation value of f for uniformly random vectors from S"^^^ 
and C is a constant with C = (9 7r'^)^^. 

To see how this purely mathematical lemma can be applied to Quantum Me- 
chanics note that normalized pure states chosen from some subspace with 
dimension rf/j can be thought of as lying on a (2 — l)-dimensional hypersphere 
with coordinates given by the real and imaginary components of the expansion 
coefficients with respect to some orthonormal basis {\i)} of T-Lr 

X2^{^) = Re[{i\^)] X2^+i{ij) = lm[{tm (C.0.3) 

and that in this coordinate system the standard Euclidean metric on R^'^-f' and 
the metric induced by the standard Hilbert space norm coincide in the sense that 

||Xl-X2|| = |||^l)- 1^2)112. (C.0.4) 
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